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(57) Abstract 

Genetic polymorphisms are identified in the human UGTl gene that alter UGTl-dependent drug metabolism. Nucleic acids comprising 
the polymorphic sequences are used to screen patients for altered metabolism for UGTl substrates, potential drug-drug interactions, and 
adverse/side effects, as well as diseases diat result from environmental or occupational exposure to toxins. The nucleic acids are used to 
establish animal, cell and in vitro models for drug metabolism. 
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GENOTYPING THE 
HUMAN UDP^LUCURONOSYLTRANSFERASE 1 (UGTI) GENE 

IhfTRODUCTION 

The metabolic processes commonly involved in the biotransformation of xenobiotics 
have been classified into functionalization reactions (phase I reactions), in which lipophilic 
compounds are modified via monooxygenation. dealkylation, reduction, aromatization, or 
hydrolysis. These modified molecules can then be substrates for the phase II reactions, 
often called conjugation reactions, as they conjugate afunctional group with a polar, 
endogenous compound. Drug glucuronidation. a major phase 11 conjugation reaction in the 
mammalian detoxification system, is catalyzed by the UDP-glucuronosyltransferases 
(UGTs) (Batt AM. et al. (1994) Clin Chim Acta 226:171-190; Burchell et al. (1995) Life Sci. 
67:1819-31). 

The UGTs are a family of enzymes that catalyze the glucuronic acid conjugation of 
a wide range of endogenous and exogenous substrates including phenols, alcohols, 
amines and fatty acids. The reactions catalyzed by UGTs perniit the conversion of a large 
range of toxic endogenous/xenobiotic compounds to more water-soluble forms for 
subsequent excretion (Parkinson A (1 996) Toxicol Pathol 24:48-57). 

The UGT isoenzymes are located primarily in hepatic endoplasmic reticulum and 
nuclear envelope (Parkinson A (1996) Toxicol Pathol 24:48-57). though they are also 
expressed in other tissues such as kidney and skin. UGTs are encoded by a large 
multigene superfamily that has evolved to produce catalysts with differing but overiapping 
substrate specificities. Three families. UGTI . UGT2. and UGTS. have been identified 
within the superfamily. UGTs are assigned to one of the subfamilies based on amino acid 
sequence identity, e.g., UGTI family members have greater than 45% amino acid 
sequence Identity (Mackenzie et al. 1^7) PharmacoaenfttirR 7:255-69). 

The UGTI locus is located on chromosome 2q37, and contains at least 12 
promoters/first exons, which are apparently able to splice with common axons 2 through 5, 
producing gene products having strikingly different N-temiinal halves (amino acid sequence 
identities ranging from 24% to 49%), but identical C-temiina! halves (Fig. 1). At least eight 
different Isoenzymes are encoded by the UGT1 locus; at least one or more first exons 
encode pseudogenes. The different N-temfilnal halves encoded by the first exons confer 
different substrate binding specificities upon the UGTI isoenzymes, while exons 2-5. which 
are present in all UGTI Isoenzyme mRNAs, encode the UDP-glucuronic acid binding 
domain, membrane anchorage site, and ER retention signal that are common to all UGT 
proteins (Ritter et al. (1992) J Bjoj Chem 267:3257-3261). UGTI locus isoenzymes are 
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best known for their role in glucoronidation and metabolism of many substrates, including 
bilirubin (1A1. 1D1). planar and non-planar phenols, naphthols (1F1) (Ouzzine M, et al. 
CSS^) Arch BjQphem Piophy^t 310:196-204). anthraquinones. flavones. aliphatic alcohols, 
aromatic carboxylic acids, and steroids (Ebner T, et al. (1993) Drug Metah nis pns 21:50- 
55). 

In addition to UGT1 exon usage, metabolism of endogenous and exogenous 
substrates can also be affected by competitive binding phenomena. For example. In some 
cases exogenous substrates for the UGT1 enzymes have a higher binding affinity or avidity 
for the enzyme than the endogenous UGT1 substrates. For example. UGT1*1. the major 
bilirubin-metabolizing form of UGT1. more readily binds both octyl-gallate and emodin than 
it binds bilirubin, thus indicating the potential of these xenobiotics to cause jaundice by 
inhibition of bilirubin binding to UGT1*1 (where 1*1 indicates that the first exon Is used in 
the spliced gene product). UGT1*1 is also responsible for glucuronidation of the oral 
contraceptive ethinylestradiol (Ebner et al. (1993) Mol. Pharmafini 43:649-54). and can 
also glucuronidate phenols, anthroquinones, flavones. and certain endogenous steroids. 

As noted above, the first exon present in UGT1 can affect substrate binding 
specificity of the UGT1 gene product (for a review, see Burchell (1995) Life Sci. 57:1819- 
31). For example. UGT1*2 accepts a wide range of compounds as substrates including 
non-planar phenols, anthraquinones. flavones. aliphatic alcohols, aromatic carboxylic acids, 
steroids (4-hydroxyestrone, estrone) and many daigs of varied structure (Ebner et al. 
(1993) Pruq, Metab, Djsp 21 :50-5; Burchell (1 995) Ufe Sci. 57:1819-31). In contrast. 
UGT1*6 exhibits only limited substrate specificity for planar phenolic compounds relative to 
other human UGTs. 

Polymorphisms can markedly affect binding of the endogenous substrate, which can 
be manifested as clinical syndromes. At least two conditions, Crigler-Naljar syndrome and 
Gilbert syndrome, are associated with UGT1 polymorphisms. Both of these syndromes are 
hereditary and are associated with predominantly unconjugated hyperbilirubinemia. Crigler- 
Najjar syndrome is associated with intense, persistent jaundice which begins at birth. 
Some affected infants die in the first weeks or months of life with kemicterus; others survive 
with little or no neurologic defect. Crigler-Najjar syndrome is caused by a defect in the 
ability of UGT1 to catalyze UDP-glucuronidation of bilirubin, resulting in accumulation of 
bilirubin in the blood (Erps et al. (1994) J. Clin. Invftst 93:564-70). Gilbert syndrome is a 
benign miW fomi of unconjugated hyperbilirubinemia that is characterized by normal liver 
fiinctfon tests, nomial Hver histology, delayed clearance of bilimbin from the blood, and mild 
jaundice that tends to fluctuate in severity. As with Crigler-Najjar syndrome, Gilbert 
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syndrome is associated with a defect in UGT1. Specific UGT polymorphisms that are 
Icnown to be associated with disease are indicated in Fig. 1. 

Alteration of the expression or function of UGTs may also affect drug metabolism. 
For example, there may be common polymorphisms in the human UGT1 gene that alter 
expression or function of the protein product and cause drug exposure-related phenotypes. 
Thus, there is a need in the field to identify UGT1 polymorphisms In order to provide a 
better understanding of drug metabolism and the diagnosis of drug exposure-related 
phenotypes. 

Relevant LrrERATURE 

Genbank accession number M84122 provides UGT1 exon 2. M84123 provides 
exons 3 and 4. M84124 provides 5. M84125 provides exon 1A, M84127 provides exon 1C, 
M84128 provides exon ID, M84129 provides exon IE, M84130 provides exon IF, U39570 
provides exon 1G, U42604 provides exon 1H, U39550 provides exon 1J. 

The UGT gene superfamily and recommended nomenclature for describing UGT 
genes and alleles are reviewed in Mackenzie et al. (1997) Pharmacoaenet 7:255-69. 

The two UGT1A6 genetic polymorphisms are described In Clotti et al. (1997) Am. J. 
Hurri. Genet, 61(Supp).A249. The identification of Asp446 as a critical residue in UGT1 is 
described in iwano et al. (1 997) Biochem .1 325:587-91 . 

A review of the substrate specificity of human UDP-glucuronosyltransferases Is 
provided by Burchell et al. (1 995) LifeSci. 57:1819-31 . For a review of drug 
glucoronidation in humans, see Miners et al. (1991) Pharniaecrf. Ther 51:347-69. 

At least twelve UGT1A1 polymorphisms have been identified and linked to disease. 
These UGT1A1 alleles, each of described in OMIM Entry 191740 (at 
http7/www.ncbi.nlm.nih.gov/htbin-post/Omlm/dispmim?191740) and in OMIM Entry 143500 
(at http7A(vww.ncbi.nlm.nlh.gov/htbin-post/Omim/dispmim?143500), include: 

1) UGT1*FB (UGT1A1. 13-BP DEL. EX2; 191740.0001). which contains a 13 bp 
deletion in exon 2 and Is assodated with Crigler-Najjar syndrome type I (CN-I); 

2) UGT1A1. EX0N4, C-T, SER-PHE (191740.0002), which contains a C-to-T 
transition In exon 4 (resulting in an amino add change from serine to phenylalanine) is 
associated with CN-I and defidency of both bilimbin-UGT and phenol-UGT activities In the 
liven 

3) UGT1A1. GLN331TER (191740.0003). which contains a C-to-T transition 6 bp 
upstream from the 3-prime end of exon 2 of the common region (replacement of a 
glutamine codon with a stop codon), is assodated with CN-I; 
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4) UGT1A1. ARG341TER (191740.0004). which contains a nonsense mutation 
(CGA-to-TGA) in exon 3 and is associated with CN-I and a total absence of all 
phenol/biiimbin UGT proteins and their activities in liver homogenate by enzymologic and 
immunochemical analysis; 

5) UGT1A1. GLN331ART (191740.0005). which contains an A-to-G transition 5 bp 
upstream of the exon 2/intron 2 boundary (resulting in a glutamine-to-arginine substitution), 
is associated with Crigier-Naijar Syndrome, type II (CN-II); 

6) UGT1A1, PHE170DEL (191740.0006), which contains a deletion of the 
phenylalanine codon at position 170 in exon 1. and is associated with CN-I; 

7) UGT1A1. SER376PHE (191740.0007). which contains a C-to-T transition in 
codon 376 (resulting in a change of serine to phenylalanine) and is associated with CN-I; 

8> UGT1A1 . GLY309GLU (191740.0008). which contains a G-to-A transition in 
codon 309 (resulting in a glycine to glutamic acid change) and is associated with CN-I; 

9) UGT1A1, NT840, C-A. CYS-TER (191740.0009). which contains a C-to-A 
transversion at base position 840 in exon 1 (resulting in replacing a cysteine with a stop 
codon). is associated with CN-I; 

10) UGT1A1, PRP229GLN (191740.00010). which contains C-to-A transversion at 
nucleotide 686 (changing proline-229 to glutamlne). is associated with Gilbert syndrome; 

11) UGT1A1. 2-BP INS. TA INS, TATAA ELEMENT (191740.00011) contains 2 
extra bases (TA) in the TATAA element of the 5-prime promoter region of the gene (where 
normally an A(TA)6TAA element is present between nucleotides -23 and -3) and is 
associated with Gilbert syndrome; and 

12) UGT1A1. 1-BP INS, 470T INS (191740.00012), which contains 470insT 
mutation in exon 1 and is assodated with CN-I. 

Summary of the Invention 

Genetic sequence polymorphisms are identified In the UGT1 gene. Nucleic adds 
comprising the polymorphic sequences are used in screening assays, and for genotyping 
individuals. The genotyping infomiation Is used to predict an individuals' rate of metabolism 
for UGT1 substrates, potential drug-drug interactions, and adverse/side effects. 

Accordingly, in one aspect the invention features an isolated nucleic add molecule 
comprising a UGT1 sequence polymorphism of SEQ ID NOS:87-124, as part of other than 
a naturally occuning chromosome. In related aspects, the Invention features nucleic add 
probes for detection of UGT1 locus polymorphisms, where the probe comprises a 
polymorphic sequence of SEQ ID NOS:87-124. 
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In another aspect the invention features an array of oligonucleotides comprising two 
or more probes for detection of UGT1 locus polymorphisms, where the probes comprise at 
least one form of a polymorphic sequences of SEQ ID NOS:87-124. 

In still another aspect, the invention features a method for detecting in an individual 
5 a polymorphism in UGT1 metabolism of a substrate, where the method comprises 

analyzing the genome of the Individual for the presence of at least one UGT1 polymorphism 
of SEQ ID NOS:87-124; wherein the presence of the predisposing polymorphism is 
Indicative of an alteration in UGT1 expression or activity. 

In one embodiment, the analyzing step of the method is accomplished by detection 
10 of specific binding between the individual's genomic DNA with an array of oligonucleotides 
comprising two or more probes for detection of UGT1 locus polymorphisms, where the 
probes comprise at least one fomi of a polymorphic sequence of SEQ ID NOS:87-124. 

In other embodiments of the method, the alteration is UGT1 expression or activity is 
tissue specific, or is in response to a UGT1 modifier. The UGT1 modifier may either induce 
15 or inhibit UGT1 expression. 



Brief Description of the Drawings 
Fig. 1 is a schematic showing the UGT1 locus. Each of the first axons is denoted by 
both its alphabetic and numerical nomenclatures (e.g., 1A and 1.1). 
20 Fig. 2 is a schematic showing exons 1A-1J of the UGT1 locus and the 

polymorphisms described in the present application. 

Fig. 3 is a schematic showing the exons 1 A-1F, and 2-5 of the UGT1 locus and the 
polymorphisms that have been publicly disclosed. 

25 Brief Description of the Sequence Listing 

UGT1 Reference Sequences, SEQ ID NOS: 1. 3, 5. 7, 9. 11, 13. and 15 are the 
UGT1 reference polynucleotide sequences for UGT1 exons 1A, 1C. ID. IE. IF. 1G. 1H, 
and 1J. The polypeptide sequences are encoded by these reference exon sequences are 
SEQ ID N0S:2, 4, 6. 8, 12, 14. and 16. SEQ ID NOS: 17 and 18 are the reference 
30 polynucleotide and amino acid sequences for UGT1 exons 2-5. 

PCR Primers, The primary and secondary PGR primers for amplification of 
polymorphic sequences are presented as SEQ ID NOS: 19-50. 

Sequencing Primers. The primers used in sequencing Isolated polymorphic 
sequences are presented as SEQ ID NOS:51-86. 
35 Polymorphisms, Polymorphic sequences of the invention are presented as SEQ ID 

NOS:88-124. 



-5- 
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Description of the Specific Embodiments 
Pharmacogenetics is the linkage between an individuars genotype and that 
individuars ability to metabolize or react to a therapeutic agent. Differences in metabolism 
or target sensitivity can lead to severe toxicity or therapeutic failure by altering the relation 
5 between bioactive dose and blood concentration of the drug. Relationships between 

polymorphisms in metabolic enzymes or drug targets and both response and toxicity can be 
used to optimize therapeutic dose administration. 

Genetic polymorphisms are identified in the UGT1 gene. Nucleic acids comprising 
the polymorphic sequences are used to screen patients for altered metabolism for UGT1 
10 substrates, potential drug-drug interactions^ and adverse/side effects, as well as diseases 
that result from environmental or occupational exposure to toxins. The nucleic acids are 
used to establish animal, cell culture and in vitro cell-free models for drug metabolism. 

Definitions 

15 It is to be understood that this invention is not limited to the particular methodology, 

protocols, cell lines, animal species or genera, constructs, and reagents described, as such 
may vary. It is also to be understood that the terminology used herein is for the purpose of 
describing particular embodiments only, and is not Intended to limit the scope of the present 
invention which will be limited only by the appended claims. 

20 As used herein the singular fomns "a", "and", and "the" include plural referents 

unless the context cleariy dictates othenn^ise. Thus, for example, reference to "a construct" 
includes a plurality of such constructs and reference to "the UGT1 nucleic acid" includes 
reference to one or more nucleic acids and equivalents thereof known to those skilled in the 
art, and so forth. All technical and scientific terms used herein have the same meaning as 

25 commonly understood to one of ordinary skill in the art to which this invention belongs 
unless cleariy indicated otherwise. 

UGT1 polymorpliic sequences. The sequence of the UGT1 gene is known in the 
art, and accessible in public databases, as cited above. This sequence is useful as a 
reference for the genomic location of the human gene, and for specific coding region 

30 sequences. As used herein, the terni "UGTI gene" is intended to refer to both the wild-type 
and variant sequences, unless specifically denoted othenvise. Nucleic acids of particular 
interest comprise the provided variant nucleotide sequence(s). For screening purposes, 
hybridization probes may be used where both polymorphic forms are present, either in 
separate reactions, or labeled such that they can be distinguished from each other. Assays 

35 may utilize nucleic acids that hybridize to one or more of the described polymorphisms. 



-6- 



wo 99/57322 PCT/US99/09702 

The genomic UGT1 sequence is of particular interest A polymorphic UGT1 gene 
sequence, i,e, including one or more of the provided polymorphisms, is useful for 
expression studies to determine the effect of the polymorphisms on enzymatic activity. The 
polymorphisms are also used as single nucleotide polymorphisms to detect genetic 
5 association with phenotypic variation in UGT1 activity and expression. 

The UGT1 exon structure is illustrated in Fig. 1. The UGT1 locus contains at least 
12 promoters/first exons. which are apparently able to splice with common exons 2 through 
5. producing gene products having different N-terminal halves but identical C-terminal 
halves. The first exon utilized at least in part determines the substrate specificity of the 

10 resulting UGT1 gene product. Each of the first exons in Fig. 1 is denoted by both its 

alphabetic and numerical nomenclatures (e.g., 1A and 1.1). Polymorphisms in the UGT1 
first exon can be associated with alteration of substrate binding specificity and/or disease. 
Fig. 2 shows UGT1 exons 1A-1J and the polymorphisms described in the present 
application. Fig. 3 shows UGT1 exons 1A-1F and 2-5 and the polymorphisms in these 

15 exons that have been publicly disclosed. Polymorphisms denoted by an asterisk (*) have 
been assigned the indicated "allele name" (e.g., *12). The specific associated disease Is 
Indicated below in parentheses for several of these disease-associated polymorphisms. 
Except for the "mutation" that is associated with Gilbert's (*28, which is not universally 
agreed upon in the literature), all mutations in exons ID, 1A, and 2-5 were isolated from 

20 individuals with disease. 

Fragments of the DNA sequence are obtained by chemically synthesizing 
oligonucleotides in accordance with conventional methods, by restriction enzyme digestion, 
by PGR amplification, etc. For the most part, DNA fragments will be of at least 1 5 nt, 
usually at least 20 nt. often at least 50 nt. Such small DNA fragments are useful as primers 

25 for PGR. hybridization screening, etc. Larger DNA fragments, i.e. greater than 100 nt are 
useful for production of the encoded polypeptide, promoter motifs, etc. For use in 
amplification reactions, such as PGR, a pair of primers will be used. The exact composition 
of primer sequences is not critical to the invention, but for most applications the primers will 
hybridize to the subject sequence under stringent conditions, as known in the art. 

30 The UGT1 nucleic acid sequences are isolated and obtained in substantial purity, 

generally as other than an intact mammalian chromosome. Usually, the DNA will be 
obtained substantially free of other nucleic add sequences that do not include a UGT1 
sequence or fragment thereof, generally being at least about 50%, usually at least about 
90% pure and are typically "recombinant", i.e. flanked by one or more nucleotides with 

35 which it is not normally associated on a naturally occurring chromosome. 
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UGT1 polypeptides. The UGT1 genetic sequence, Including polymorphisms, may 
be employed for synthesis of a complete UGT1 protein, or polypeptide fragments thereof, 
particularly fragments corresponding to functional domains; binding sites; etc/, and 
including fusions of the subject polypeptides to other proteins or parts thereof. For 
5 expression, an expression cassette may be employed, providing for a transcriptional and 
translational initiation region, which may be inducible or constitutive, where the coding 
region is operably linked under the transcriptional control of the transcriptional initiation 
region, and a transcriptional and translational termination region. Various transcriptional 
initiation regions may be employed that are functional in the expression host. The 
10 polypeptides may be expressed in prokaryotes or eukaryotes in accordance with 

conventional ways, depending upon the purpose for expression. Small peptides can also 
be synthesized in the laboratory. 

Substrate. A substrate is a chemical entity that is modified by UGT1 , usually under 
normal physiological conditions. Although the duration of drug action tends to be shortened 
15 by metabolic transformation, drug metabolism is not "detoxification". Frequently the 
metabolic product has greater biologic activity than the drug itself. In some cases the 
desirable pharmacologic actions are entirely attributable to metabolites, the administered 
drugs themselves being inert. Likewise, the toxic side effects of some drugs may be due in 
whole or in part to metabolic products. 
20 Substrates can be either endogenous substrates (e.g. , substrates normally found 

within the natural environment of U6T1 , such as the bilirubin or other endobiotic 
compound) or exogenous (e.g., substrates that are not normally found within the natural 
environment of UGT1 , such as ethinyl estradiol or other xenobiotic compound). Exemplary 
UGT1 substrates (/.e., substrates of wild-type UGTt and/or UGT1 polypeptides encoded by 
25 UGT1 polymorphisms) include, but are not necessarily limited to endobiotics such as 
bilirubin, bilirubin monoglucoronide, bile adds, steroids, thyroxine, biogenic amines, fat- 
soluble vitamins, UDPGA, 17p estradiol, estriol, 2-hydroxy-estriol, T4,rT3, and the like; and 
xenobiotics such as hydroxylated polycyclic aromatic hydrocarbons, heterocyclics, 
carcinogens, plant metabolites, octyl gallate, ethinylestradiol, anthrafiavic acid, quercetin, 1- 
30 naphthol, naphthylamines, 4'-aminobiphenyl, benzidine, imipamine, BP-3,6-quinol, 5- 

hydroxy-BP, acetaminophen, vanillin, naproxen, 4-methylumbelliferone, monohalogenated 
phenols, propofoi, 4t-pentylphenoi, 4-hydroxybiphenyl, carvacroK emodin, galangin, bulky 
phenols, carboxylic acids, 5-hydroxy 2AAF. 8-hydroxy 2AAF, and the like. Table 1 provides 
a summary of the major endobiotic and xenobiotic substrates, as well as exemplary non- 
35 substrates, of four UGT1 isoenzymes (UGT1*1 (same as UGT1A), UGT1*4 (same as 
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UGT1D), UGT1*6 (same as UGT1F). and UGT1*02 (same as UGT1G) (see Burcheil et al. 
91995) Life Sci. 57:1819-31) 



Table 1 Substrate Specificity of Human Liver UGT1 Isoenzymes 
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AenoDiotic 


Non-substrate 


UGT1*1 


Bilirubin (Km 24 jim) 
Bilirubin monoglucuronide 
UDPGA (Km 0.41 mM) 

1 / Li CdiiauiUI 

Estriol 

2-hydrozy-estriol 
T4,rT3 


Octyl galiate (Km 162 pm) 

Ethinylestradiol 

Andiraflavic acid 

Quercetin 

1-naphthol 


Gallic acid 
T3 

Menthol 

Retinoic acid 

Clofibrate 

Morphine 

Propofol 

Testosterone 


UGT1*4 


Bilirubin? 


Naphthylamines 
4-aminobiphenyl 
Benzide 
Imipamine 


Bilirubin? 
Carbamazepine 


UGT1*6 




1-Naphthol 

BP-3,6-quinol 

5-hydroxy-BP 

Acetaminophen (Km 2 mM) 

Vanillin 

Naproxen 

4-methylumbelliferone 
Monohaiogenated phenols 


4-HydFoxybiphenyl 

Propofol 

Galangin 

Emodin 

Morphine 

Estriol 

Estradiol 

AZT 

Menthol 


UGT1*7 


UDPGA (Km 0.41 mM) 
T4^T3 


Propofol (Km 172 urn) 
4t-pentylphenol 

4- hydroxybiphenyl 
Carvacrol 
Emodin 
Galangin 

Octyl galiate (Km 158 fiM) 
Odier bulky phenols 
Acetaminophen (Km 50 mM) 
Carboxylic acids (some) 

5- hydroxy 2AAF 
8-hydroxy 2AAF 


Morphine 

Estriol 

Estradiol 

AZT 

Menthol 

Chloramphenicol 

Androsterone 

T3 



Modifier A modifier is a chemical agent that modulates the action of UGT1. either 
through altering its enzymatic activity (enzymatic modifier) or through modulation of 
expression (expression modifier, e.g., by affecting transcription or translation). In some 
cases the modifier may also be a substrate. For example, the UGT1 gene contains an 
15 electrophile responsive element (USPN 5,589,504); thus, compounds such as metabolites 
of planar aromatic compounds and phenolic antioxidants, as well as reactive oxygen 
species including peroxides would be expression modifiers via their effect on the 
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electrophile responsive element. Endogenous and exogenous inducers that are capable of 
inducing particular UGT activities include phenobarbital, dioxin, peroxisome proliferators, 
rifamicin, oral contraceptive drug, carbamazepine, cigarette smoke, cabbage, brussel 
sprouts, polycyclic/aromatic hydrocarbons, and derivatives of indole 3-carbonil (see 
5 Burchell et al. (1995), supra, Parkinson In: "Biotransformation of Xenoblotics." Chapter 6, 
Casarett & DquH's Toxicoloov . 5*^ Ed., C. Klaassen, ed.)). 

Pharmacokinetic parameters. Pharmacokinetic parameters provide fundamental 
data for designing safe and effective dosage regimens. A dmg's volume of distribution, 
clearance, and the derived parameter, half-life, are particulariy important, as they detemiine 
10 the degree of fluctuation between a maximum and minimum plasma concentration during a 
dosage interval, the magnitude of steady state concentration and the time to reach steady 
state plasma concentration upon chronic dosing. Parameters derived from in vivo drug 
administration are useful in determining the clinical effect of a particular UGT1 genotype. 
Expression assay. An assay to determine the effect of a sequence polymorphism 
15 on UGT1 expression. Expression assays may be perfonned in cell-free extracts, or by 
transforming cells with a suitable vector. Alterations in expression may occur in the basal 
level that is expressed in one or more cell types, or in the effect that an expressicm modifier 
has on the ability of the gene to be inhibited or induced. Expression levels of a variant 
alleles are compared by various methods known in the art. Methods for determining 
20 promoter or enhancer strength include quantitation of the expressed natural protein; 
insertion of the variant control element Into a vector with a reporter gene such as 
^alactosidase, luciferase, chloramphenicol acetyltransferase. etc, that provides for 
convenient quantitation; and the like. 

Gel shift or electrophoretic mobility shift assay provides a simple and rapid method 
25 for detecting DNA-binding proteins (Ausubel. F.M. ef a/. (1989) In: Cunrent Protocols in 
Molecular Biology, Vol. 2, John Wiley and Sons, New Yori<). This method has been used 
widely in the study of sequence-specific DNA-binding proteins, such as transcription 
factors. The assay is based on the observation that complexes of protein and DNA migrate 
through a nondenaturing polyacrylamide gel more slowly than free DNA fragments or 
30 double-stranded oligonucleotides. The gel shift assay Is perfomried by incubating a purified 
protein, or a complex mixture of proteins (such as nuclear or cell extract preparations), with 
an end-labeled DNA fragment containing the putative protein binding site. The reaction 
products are then analyzed on a nondenaturing polyacrylamide gel. The specificity of the 
DNA-binding protein for the putative binding site is established by competition experiments 
35 using DNA fragments or oligonucleotides containing a binding site for the protein of 
interest, or other unrelated DNA sequences. 
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wo 99/57322 PCT/US99/09702 
Expression assays can be used to detect differences in expression of 
polymorphisms with respect to tissue specificity, expression level, or expression in 
response to exposure to various substrates, and/or timing of expression during 
development. For example, since UGT1A and UGT1E are expressed in liver, UGT1A and 
UGT1E polymorphisms could be evaluated for expression in tissues other than liver, or 
expression in liver tissue relative to a reference UGT1A or UGT1E polypeptide. Similarly, 
expression of polymorphisms in UGT1F, which is normally expressed in liver, kidney and 
skin, could be assayed in each of these tissues and the relative levels of expression 
compared to a reference UGT1F polypeptide. 

Substrate screening assay, Sut)strate screening assays are used to determine the 
metabolic activity of a UGT1 protein or peptide fragment on a substrate. Many suitable 
assays are known in the art. including the use of primary or cultured cells, genetically 
modified cells (e.g., where DNA encoding the UGT1 polymorphism to be studied is 
introduced into the cell within an artificial construct), cell-free systems, e.g. microsomal 
preparations or recombinantly produced enzymes in a suitable buffer, or in animals, 
including human clinical trials (see. e.g.. Burchell et al. (1995) Life Sci. 57:1819-1831. 
specifically incorporated herein by reference. Where genetically modified cells are used, 
since most cell lines do not express UGT1 activity (liver cells lines being the exception), 
Introduction of artificial construct for expression of the UGT1 polymorphism into many 
human and non-human cell lines does not require additional modification of the host to 
inactivate endogenous UGTl expression/activity. Clinical trials may monitor serum, urine, 
efc. levels of the substrate or its metabolite(s). 

Typically a candidate substrate is input into the assay system, and the conversion to 
a metabolite is measured over time. The choice of detection system is determined by the 
substrate and the specific assay parameters. Assays are conventionally run, and will 
include negative and positive controls, varying concentrations of substrate and enzyme, etc. 

Genotyping: U6T1 genotyping is performed by DNA or RNA sequence and/or 
hybridization analysis of any convenient sample from a patient, e.g. biopsy material, blood 
sample (serum, plasma, etc.). buccal cell sample, etc. A nucleic acid sample firom an 
individual is analyzed for the presence of polymorphisms in UGTl, particulariy those that 
affect the activity or expression of UGTl . Specific sequences of interest include any 
polymorphism that leads to changes in basal expression in one or more tissues, to changes 
in the modulation of UGTl expression by modifiers, or alterations in UGTl substrate 
specificity and/or activity. 
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Linkage Analysis: Diagnostic screening may be performed for polymorphisms that 
are genetically linked to a phenotypic variant in UGT1 activity or expression, particulariy 
through the use of microsatellite markers or single nucleotide polymorphisms (SNP). The 
microsatellite or SNP polymorphism itself may not phenotypically expressed, but is linked to 
5 sequences that result in altered activity or expression. Two polymorphic variants may be in 
linkage disequilibrium, i.e. where alleles show non-random associations between genes 
even though individual loci are in Hardy-Weinberg equilibrium. 

Linkage analysis may be performed alone, or in combination with direct detection of 
phenotypically evident polymorphisms. The use of microsatellite maricers for genotyping is 
10 well documented. For examples, see Mansfield et al. (1994) Genomics 24:225-233; and 
Ziegle et al. (1992) Genomics 14:1026-1031. The use of SNPs for genotyping is illustrated 
in Undertiill et al. (1996) Free Natl Acad Sci U S A 93:196-200, 

Transgenic animals. The subject nucleic acids can be used to generate genetically 
15 modified non-human animals or site specific gene modifications in cell lines. The term 
•transgenic" is intended to encompass genetically modified animals having a deletion or 
other knock-out of UGT1 gene activity, having an exogenous UGT1 gene that is stably 
transmitted in the host cells, or having an exogenous UGT1 promoter operaWy linked to a 
reporter gene. Transgenic animals may be made through homologous recombination, 
20 where the UGT1 locus is altered. Alternatively, a nucleic acid construct is randomly 

integrated into the genome. Vectors for stable integration include plasmids. retroviruses 
and other animal viruses, YACs, and the like. Of interest are transgenic mammals, e.g. 
cows, pigs, goats, horses, etc.. and particularly rodents, e.g. rats, mice. etc. 

25 Genetically Modified Cells. Primary or cloned cells and cell lines are modified by the 

introduction of vectors comprising UGT1 gene polymorphisms. The gene may comprise 
one or more variant sequences, preferably a haplotype of commonly occurring 
combinations. In one embodiment of the invention, a panel of two or more genetically 
modified cell lines, each cell line comprising a UGT2B4 polymorphism, are provided for 

30 substrate and/or expression assays. The panel may further comprise cells genetically 
modified with other genetic sequences, including polymorphisms, particulariy other 
sequences of interest for phamiacogenetic screening, e.g. UGT1, other UGT2 sequences, 
cytochrome oxidase polymorphisms, etc. 

Vectors useful for introduction of the gene Include plasmids and viral vectors, e.g. 

35 retroviral-based vectors, adenoviais vectors, efc. that are maintained transiently or stably in 
mammalian cells. A wide variety of vectors can be employed for transfection and/or 
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Integration of the gene Into the genome of the cells. Alternatively, micro-injection may be 
employed, fusion, or the like for introduction of genes into a suitable host cell. 



GenotVDinQ Methods 
The effect of a polymorphism in the UGT1 gene sequence on the response to a 
particular substrate or modifier of UGT1 is determined by in vitro or in vivo assays. Such 
assays may include monitoring the metabolism of a substrate during clinical trials to 
determine the UGT1 enzymatic activity, specificity or expression level. Generally, in vitro 
assays are useful in determining the direct effect of a particular polymorphism, while clinical 
studies will also detect an enzyme phenotype that is genetically linked to a polymorphism. 

The response of an individual to the substrate or modifier can then be predicted by 
detentiining the UGT1 genotype, with respect to the polymorphism. Where there is a 
differential distribution of a polymorphism by racial background, guidelines for drug 
administration can be generally tailored to a particular ethnic group. 

The basal expression level in different tissue may be determined by analysis of 
tissue samples from individuals typed for the presence or absence of a specific 
polymorphism. Any convenient method may be use, e.g. ELISA, RIA, etc. for protein 
quantitation, northern blot or other hybridization analysis, quantitative RT-PCR, etc. for 
mRNA quantitation. The tissue specific expression is correlated with the genotype. 

The alteration of UGT1 expression in response to a modifier is determined by 
administering or combining the candidate modifier with an expression system, e.g. animal, 
cell, in vitro transcription assay, etc. The effect of the modifier on UGT1 transcription 
and/or steady state mRNA levels is detemiined. As with the basal expression levels, tissue 
specific interactions are of interest. Correlations are made between the ability of an 
expression modifier to affect UGT1 activity, and the presence of the provided 
polymorphisms. A panel of different modifiers, cell types, etc. may be screened in order to 
detennine the effect under a number of different conditions. 

A UGT1 polymorphism that results in altered enzyme activity or specificity is 
detemiined by a variety of assays known in the art. The enzyme may be tested for 
metabolism of a substrate in vitro, for example in defined buffer, or in cell or subcellular 
lysates, where the ability of a substrate to be metabolized by UGT1 under physiologic 
conditions is detemiined. Where there are not significant issues of toxicity from the 
substrate or metabolite(s), in vivo human trials may be utilized, as previously described. 

The genotype of an individual is detemiined with respect to the provided UGT1 gene 
polymorphisms. The genotype is useful for detennining the presence of a phenotyplcally 
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evident polymorphism, and for determining tiie linl<age of a polymorphism to phenotypic 
change. 

A number of methods are available for analyzing nucleic acids for the presence of a 
specific sequence. Where large amounts of DNA are available, genomic DNA is used 
directly. Alternatively, the region of interest is cloned into a suitable vector and grown in 
sufficient quantity for analysis. The nucleic acid may be amplified by conventional 
techniques, such as the polymerase chain reaction (PGR), to provide sufficient amounts for 
analysis. The use of the polymerase chain reaction is described in Saiki et al. (1985) 
Science 230:1350-1354, and a review of cunrent techniques may be found In Sambrook et 
al. Molecular Cloning: A Laboratory Manual. CSH Press 1989. pp. 14. 2-1 4. 33. 
Amplification may be used to detennine whiether a polymorphism is present, by using a 
primer that is specific for the polymorphism. Alternatively, various methods are known in 
the art that utilize oligonucleotide ligation as a means of detecting polymorphisms, for 
examples see Riley et al. (1990) Nucleic Adds Re.<t 18:2887-2890; and Delahunty et al. 
(1996) Am J Hum Genet 58:1239-1246. 

A detectable label may be included in an amplification reaction. Suitable labels 
include fluorochromes, e.g. fluorescein isothiocyanate (FITC). rhodamine. Texas Red, 
phycoerythrin. allophycocyanin, 6-carboxyfluorescein (6-FAM), 2',7'-dimethoxy-4',5'- 
dichloro-6-carboxyfluorescein (JOE). 6-carboxy-X-rhodamine (ROX). 6-carboxy-2',4',7'.4,7- 
hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N',N'-tetramethyl-6- 
carboxyrhodamine (TAMRA), radioactive labels, e.g. 32P, 35S. 3H; etc. The label may be 
a two stage system, where the amplified DNA is conjugated to biotin. haptens, etc. having a 
high affinity binding partner, e.g. avidin. specific antibodies, etc., where the binding partner 
is conjugated to a detectable label. The label may be conjugated to one or both of the 
primers. Alternatively, the pool of nucleotides used in the amplification is labeled, so as to 
incorporate the label into the amplification product. 

The sample nucleic acid. e.g. amplified or cloned fragment, is analyzed by one of a 
number of methods known in the art. The nucleic add may be sequenced by dideoxy or 
other methods. Hybridization with the variant sequence may also be used to detemilne its 
presence, by Southern blots, dot blots, etc. The hybridization pattem of a control and 
variant sequence to an an-ay of oligonudeotide probes immobilized on a solid support, as 
described in U.S. 5,445,934. or in WO95/35505, may also be used as a means of detecting 
the presence of variant sequences. Single strand confomnational polymorphism (SSCP) 
analysis, denaturing gradient gel electrophoresis (DGGE), mismatdi deavage detedion, 
and heteroduplex analysis in gel matrices are used to deted confomiational changes 
created by DNA sequence variation as alterations in eledrophoretic mobility. Alternatively, 
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Where a polymorphism creates or destroys a recognition site for a restriction endonuclease 
(restriction fragment length polymorphism, RFLP), the sample is digested with that 
endonuclease, and the products size fractionated to detemnine whether the fragment was 
digested. Fractionation is performed by gel or capillary electrophoresis, partlculariy 
5 acrylamide or agarose gels. 

In one embodiment of the invention, an an-ay of oligonucleotides are provided, 
where discrete positions on the an-ay are complementary to one or more of the provided 
polymorphic sequences, e.g. oligonucleotides of at least 12 nt. frequently 20 nt. or larger, 
and including the sequence flanking the polymorphic position. Such an an-ay may comprise 

10 a series of oligonucleotides, each of which can specifically hybridize to a different 

polymorphism. For examples of an-ays, see Hacia et al. (1 996) Nat Genet 14:441-447 and 
DeRisi et al. (1996) Nat Genet 14:457-460. An^ays of interest may further comprise 
sequences, including polymorphisms, of other genetic sequences, particularly other 
sequences of interest for phamiacogenetic screening, e.g. UGT1, other UGT2 sequences, 

15 cytochrome oxidase polymorphisms, etc. 

The genotype infomriation is used to predict the response of the individual to a 
particular UGT1 substrate or modifier. Where an expression modifier inhibits UGT1 
expression, then drugs that are a UGT1 substrate will be metabolized more slowly if the 
modifier is co-administered. Where an expression modifier Induces UGT1 expression, a 

20 co-administered substrate will typically be metabolized more rapidly. Similariy, changes in 
UGT1 activity will affect the metabolism of an administered drug. The pharmacokinetic 
effect of the interaction will depend on the metabolite that is produced, e.g. a prodrug is 
metabolized to an active fomi, a drug is metabolized to an inactive form, an environmental 
compound is metabolized to a toxin, etc. Consideration is given to the route of 

25 administration, drug-drug interactions, drug dosage, etc. 

Examples 

The following examples are put forth so as to provide those of ordinary skill in the 
art with a complete disclosure and description of how to make and use the subject 

30 invention, and are not intended to limit the scope of what is regarded as the invention. 
Efforts have been made to ensure accuracy with respect to the numbers used (e.g., 
amounts, temperature, concentrations, etc.) but some experimental errors and deviations 
should be allowed for. Unless othenArtse indicated, parts are parts by weight, molecular 
weight is average molecular weight, temperature is in degrees centigrade; and pressure is 

35 at or near atmospheric. 
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Example: Identification of UGTI Polymorphisms 
MATERIALS AND METHODS 

DAM Samples. Blood specimens were collected from approximately 48 individuals 
after obtaining informed consent. All samples were stripped of personal identifiers to 
maintain confidentiality. Genomic DNA was isolated from these samples using standard 
techniques. Genomic DNA was stored either as a concentrated solution, or in a dried form 
in microtiter plates. 

PCR ampl'rfications. The primers used to amplify all exons are shown in Table 2. and 
were designed with NBI's Oligo version 5.0 program. 
Table 2. PCR Primers. (Ex = Exon) 



PRIMARY PCR AMPLIFICATION 
EX FORWARD PRIMER 

TGGTGTATCG ATTGGTmr (SEQ ID 
N0:19) 

ACAAGGTAATTAAGATGAAGAAAGCA 
(SEQIDN0:21) 

TTTGTCTTCCAATTACATGC (SEQ ID 
NO:23) 

TCTCAGTGACAAGGTAATTAAGAC(SE 
QIDNO:25) 

AATTTGGGTTCTTACATATCAA(SEQ ID 
NO:27) 

ATAAGTACACGCCTTCTTTTG (SEQ ID 
NO:29) 

CGCCTACGTATCATAGCAGTrA(SEQ ID 
N0:31) 

TCTTTCCGCCTACTGTATCA (SEQ ID 
NO:33) 



lA 
IC 
ID 
IE 
IF 
IG 
IH 
IJ 



REVERSE PRIMER 

CATATATCTGGGGCTAGTTAATC (SEQ ID 
NO:20) 

ACCTGAGATAGTGGCTTCCT (SEQ ID 
NO:22) 

AGTAGATATGGAAGCACTTGTAAG (SEQ 
IDNO:24) 

CATTGATTGGATAAAGGCA (SEQ ID NO:26) 

GAGTGAGGGAGGACAGAG (SEQ ID NO:28) 

GCTGCTTTATACAATTTGCTAC (SEQ ID 
NO:30) 

GGAAAGAAATTTGAAATGCAAC (SEQ ID 

NO:32) 

TTCAAGAAGGGCAGTTTTAT (SEQ ID 
NO:34) 



SECONDARY PCR AMPLIFICATION 

EX FORWARD PRIMER 

1 A CTCTGGCAGGAGCAAAG (SEQ ID 
NO:35) 

IC GGTAATTAAGATGAAGAAAGCA(SEQ 
IDNO:37) 

ID GTGGCTCAATGACAAGG (SEQ ID 
NO:39) 

IE TTAAGACGAAGGAAACAATTCT(SEQ 
IDN0:41) 

IF ATCAAAGGGTAAAATTCAGA (SEQ ID 
N0:43) 

IG TTTTGAGGGCAGGTTCTA (SEQ ID 
N0:45) 

IH TTCTCTCATGGCTCGCA (SEQ ID N0:47) 

1 J CCGCCTACTGTATCATAGCA (SEQ ID 
N0:49) 



REVERSE PRIMER 

ATACACACCTGGGATAGTGG (SEQ ID 
NO:36) 

CTGAGATAGTGGCTTCCTG (SEQ ID NO:38) 

ATATGGAAGCACrTGTAAGTAAA(SEQ ID 
NO:40) 

ACCTGAGATAGTGGCTTCC (SEQ ID NO:42) 

GGCAGTCCAAAAGAAATA (SEQ ID NO:44) 

AATGGGACAAATGTAAATGATA (SEQ ID 
NO:46) 

ATGTCAAATCACAATTCAGTAAGG (SEQ ID 
NO:48) 

CAACGAAATGTCAAATCACAG (SEQ ID 
N0:50) 
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Publicly available genomic sequences were used as references. Twenty-five 
nanograms of genomic DNA were amplified in the primary amplifications using the Peikm 
Elmer GeneAmp PGR kit according to the manufacturer's instructions in 25 pi reactions with 
AmpliTaq Gold DNA polymerase. Reactions contained 25 mM MgCI2 and 0.2 pM of each 
5 primer. Thermal cycling was performed using a GeneAmp PGR System 9600 PGR machine 
(Parkin Elmer), utilizing a touch-down PGR protocol. The protocol, unless indicated othenwise 
in Table 3. consisted of an initial incubation of 95°G for 10 min. followed by eight cycles of 
Qd'C for 20 sec, 66»C (minus 1 X per cycle) for 15 sec. 72-G for 2 min. and twenty-seven 
cycles of QS^C for 20 sec. 54X for 15 sec. 72X for 2 min. and one final extension step of 
10 72°Gfbr10min. 

For the secondary PGR reactions, one microliter of each primary PGR reaction was re- 
amplified using the secondary PGR primers, also listed in Table 2. The themial cycling profile 
that was used for the primary PGR for an exon was used for the secondary PGR. 



15 Tables. Gycling Profile Modifications 



20 



Exon 


Primary PCR 


Secondary PCR 


IE 


Touch-Down PCR step: 8 cycles 

64 C (minus 1 C per cycle), for 15 sec 

Total Number of cycles: 35 


same as Primary PCR 


IF 


Touch-Down PCR step: 10 cycles 
64 C (minus 1 C per cycle), for 15 sec 
Total Number of cycles: 35 


same as Primary PCR 


IG 


Touch-Down PCR step: 7 cycles 

64 C (minus 1 C per cycle), for 15 sec 

Total Number of cycles: 35 


same as Primary PCR 


IH 


Touch-Down PCR step: 10 cycles 
66 C (minus 1 C per cycle), for 15 sec 
Total Number of cycles: 35 


same as Primary PCR 



30 DAM sequencing. PGR products from 48 individuals, approximately one-third 

representing each of the 3 major racial groups (see above), were prepared for sequencing 
by treating 8 \iL of each PGR product with 0.15 pL of exonuclease I (1 .5 U/reaction), 0.3 pL 
of Shrimp Alkaline Phosphatase (0.3 U/reaction), q.s. to 10 pL with MilliQ water, and 
incubated at 37°G for 15 min. followed by 72X for 15 min. Gycle sequencing was 

35 perfomned on the GeneAmp PGR System 9600 PGR machine (Perkin Elmer) using the ABI 
Prism dRhodamine Temiinator Gycle Sequencing Ready Reaction Kit according to the 
manufacturer's directions, with the following changes: (1) 2 pL of dRhodamine temiinator 
premix, instead of 8 pL; and (2) 10% (v/v) Dimethylsulfoxide was added to each individual 
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nucleotide. The oligonucleotide primers (unlabeled), at 3 picomoles per reaction, used for 
the sequencing reactions are listed in Table 4. Sequencing reactions, with a final volume of 
5 ML, were subjected to 30 cycles at 96°C for 20 sec, SOX for 5 sec, and 60°C for 4 min, 
followed by ethanol precipitation. After decanting the ethanol, samples were evaporated to 
dryness using a SpeedVac for roughly 15 min and were resuspended in 2 pi of loading 
buffer (6:1 deionized fomfiamide:50 mM EDTA pH 8.0). The samples were then, heated to 
94'*C for 2 min, and electrophoresed through 5.25% polyacrylamide/6M urea gels in an ABI 
Prism 377 DMA Sequencer according to the manufacturer's instructions for sequence 
determination. All sequences were detenriined from both the 5' and 3' (sense and 
antisense) direction. 

Of the forty-eight samples, 38 polymorphisms were identified. The polymorphisms 
are described in Table 5 below. 
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Table 4. Sequencing Primers (No. = Polymorphism No.) 



REVERSE PRIMER 



FORWARD PRIMER 

CTCTGGCAGGAGCAAAG (SEQIDN0:51) ACAGTGGGCAGAGACAG (SEQ ID 

NO:52) 

GTGGTTTATTCCCCGTAT (SEQ ID NO:53) ATACACACCTGGGATAGTGG (SEQ ID 

NO:54) 

GAAATGGCATAGGTTGTC (SEQ ID 
NO:56) 

GGCCACACTCAACTGTA (SEQ ID NO:57) CTCAAAAAAAACACAGTAGG (SEQ ID 

NO:58) 

ACTTTTTCTGCCCCTTAT (SEQ ID NO:59) ATATGGAAGCACTTGTAAGTAAA (SEQ 

IDNO:60) 

9-12 TTAAGACGAAGGAAACAATTCT (SEQ ID AATGGCATACGTTGTCA (SEQ ID NO:62) 
NO:61) ^ 



No. 
1 

2 

3-5 
6 

7,8 



GGTAATTAAGATGAAGAAAGCA (SEQ 
IDNO:55) 



13,14 AGAATGGCAATTATGAACA (SEQ ID 
NO:63) 

15-17 AGAATGGCAATTATGAACA (SEQ ID 
NO:<55) 

18-24 CTCTGGC ICTGTCCTAC* (SEQ ID 
NO:67) 

25 ATCAAAGGGTAAAATTCAGA (SEQ ID 
NO:69) 

26 AATTTGCTTTTGAAAGAATC (SEQ ID 
NO:71) 

27^8 AATTTGCmTGAAAGAATC (SEQ ID 

NO:73) 

29,30 TTTTGAGGGCAGGTTCTA (SEQ ID 
NO:75) 

3 1,32 rrGCAGGAGTTTGTTTAAT (SEQ ID 
NO:77) 

33 CATTGCAGGAGTTTGTTTA (SEQ ID 
NO:79) 

34 AGAAATAGCCTCTGAAATTC (SEQ ID 
NO:81) 

35 CCGCCTACTGTATCATAGCA (SEQ ID 

NO:83) 

36-38 ATTTTGCCAGTATCl 1 1 1 1 AG (SEQ ID 



TGTGTGCCCTTAAAGTCT (SEQ ID 
NO:64) 

ACCTGAGATAGTGGCTTCC (SEQ ID 
NO:66) 

ACCTGAGATAGTGGCTTCC (SEQ ID 
NO:68) 

CAGCAGCTTGTCACCTAC (SEQ ID 
NO:70) 

GGTAGGCCCAAATACTCA (SEQ ID 
NO:72) 

GGCAGTCCAAAAGAAATA (SEQ ID 
NO:74) 

CACCTCTGGCATGACTAC (SEQ ID 
NO:76) 

AATGGGACAAATGTAAATGATA (SEQ 
ID NO:78) 

CATCTGAGAACCCTAAGAGA (SEQ ID 
NO:80) 

ATGTCAAATCACAATrCAGTAAGG(SEQ 
IDNO:82) 

GAGTGTACGAGGTTGAGTAAG (SEQ ID 
NO:84) 

CAACGAAATGTCAAATCACAG (SEQ ID 
NO:86) 



NO:85) 

* Note polymorphism in primer. The reference sequence has a "C" at the highlighted 
position 
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Table 5. UGT1 polymorphisms. Amino acid changes numbered from first methionine for 
that exon (Ex). 



No 


Ex 


Ntd 


AA 


SEQUENCE (SEQ ID NO:) 


1 


lA 


G227A 


GIy71 Arg 


CATCAGAGAC A.GAGCATTTTACACCTT(SEQ ID 
NO:87) 


2 


lA 


T765C 


Ser251 Pro 


GGACCTATTGAGC CCTGCATCTGTCT (SEQ ID NO:88) 


3 


IC 


T75C 


Trp 1 1 Arg 


GGTTCCCCTGCCG C GGCTGGCCACA (SEQ ID NO:89) 


4 


IC 


G 125 A 




GCCCTGGGCTGA A AGTGGAAAG (SEQ ID NO:90) 


5 


IC 


T184C 


VaI47 Ala 


ATGCGGGAGG C CTTGCGGGAGCT (SEQ ID NO:91) 


6 


IC 


A 521 G 




CTCTGCGCGGC G.GTGCTGGCTAAG (SEQ ID NO:92) 


7 


ID 


G848 A 




TACCCCAGGCC A ATCATGCCCAACA (SEQ ID NO:93) 


8 


ID 


C43T 


Intronic 


TCCAGGCAAAA XACTTTTTAAAAAATG(SEQ ID 
NO:94) 


9 


IE 


T187C 


Leu 48 Ser 


AGCATGCGGGAGGCCT C.GCGGGA (SEQ ID NO:95) 


10 


IE 


C 194 G 


Asp 58 GIu 


GCGGGA G CTCCATGCGAGAGG (SEQ ID NO:96) 


11 


IE 


T232C 


Leu 63 Pro 


TGGTGGTCCTCACCC C GGAGGTGAA (SEQ ID NO:97) 


12 


IE 


A257G 




TACATCAAAGA G GAGAACTTTTTCAC (SEQ ID 
NO:98) 


13 


IE 


C468A 


His 142 Asn 


TGATCAGGCACCTG A ATGCTACTTCC (SEQ ID NO:99) 


14 


IE 


C517G 


Ala 158 Gly 


ACCTCTGCG G GGCGGTGCTGG (SEQ ID NO.lOO) 


15 


IE 


C689T 




AAGAACATGCT XTACCCTCTGGC (SEQ ID NOrlOl) 


16 


IE 


C701T 




CTCTGGCICTGTCCTACC (SEQ IDNO:102) 


17 


IE 


C717T 




TCCTACCTTTGC T ATGCTGnTCT(SEQ IDNO:103) 


18 


IE 


C786A 


Leu 248 He 


TGTCAGTGGTGGAT A TT (SEQ ID NO: 104) 


19 


IE 


G789C 


Val249Leu 


GGTGGAT A TTCITCAGC (SEQ IDNO:105) 


20 


IE 


C795T 


His 251 Tyr 


TCAGC 1 ATGCATC (SEQ ID NO:106) 


21 


IE 


T803C 


Ser253Phe 


GCATC £GTGTGGCTGTTCCGA (SEQ ID NO: 107) 


22 


IE 


G819C 


Gly 259 Arg 


TGGCTGTTCCGA CGGGACTT (SEQ ID NO: 108) 


23 


IE 


T827C 




GGGACTTC GTGATGGA (SEQ ID NO:109) 


24 


IE 


T836C 




GTGATGGA C TACCCCAGGCCGAT (SEQ ID N0:1 10) 


25 


IF 


T161 G 


Ser 7 Ala 


CCTGCCTCCTTCGC G CATTTCAGAG (SEQ ID N0:1 1 1) 


26 


IF 


A457G 




GCGATCATTCCT G. ACTGCTCCTCAG (SEQ ID NO:l 12) 


27 


IF 


A683G 


Thr 181 Ala 


CCCTGGAGCAT G CATTCAGCAG (SEQ ID NO: 1 13) 


28 


IF 


A694C 


Ai^g 184 Ser 


CATTCAGCAG aAGCCCAGACCCT (SEQ ID N0:1 14) 


29 


IG 


T35G 




TACTTCITCCAC aTACTATATTA (SEQ ID NO: 1 15) 


30 


IG 


C124A 




GGCCTCCTTCC A,CTATATGTGTGT (SEQ ID N0:1 16) 


31 


IG 


T712C 


Trp 208 Arg 


GGAGAGAGTA aCGAACCACAT (SEQ ff) NO: 1 17) 


32 


IG 


G846A 




TCAATTTGGTT A TTGCGAACTGA (SEQ ID NO: 1 1 8) 


33 


IH 


G518C 


Gly 173 Ala 


CAGGGGAATAG C.TTGCCACTAT (SEQ ID NO:l 19) 


34 


IH 


A765G 




TGTTGCGAAC G GACTTTGTTTTGG (SEQ ID NO:120) 


35 


IJ 


G127A 




TTCACCAGCA A TCGGTGGTGG (SEQ ID NO:121) 


36 


IJ 


C694T 




CTAGAAATAGC ITCTGAAATTCTCC (SEQ ID NO:122) 


37 


IJ 


C731 A 


Leu 244 He 


CGGCATATGAT A TCTACAGTCACA (SEQ ID NO: 123) 


38 


IJ 


T761 C 


Arg 254 Stop 


TCAATTTGGTTG C TGCGAACAGGAC (SEO ID NO: 124) 




The astensk associated with the second nucleotide residue in polymorphism no, 19 



is in the sequence surrounding the newly discovered polymorphism at residue 789 
(nucleotide change from C at residue 786 to A). 
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All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
admission that the present invention is not entitled to antedate such publication by virtue of 
prior invention. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be readily apparent 
to those of ordinary skill in the art in light of the teachings of this invention that certain 
changes and modifications may be made thereto without departing from the spirit or scope 
of the appended claims. 
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What is Ciaimed is: 



PCT/US99/09702 



1. An isolated nucleic acid molecule comprising a UGT1 sequence 
polymorphism of SEQ ID NOS: 87-124, as part of other than a naturally occurring 

5 chromosome. 

2. A nucleic acid probe for detection of UGT1 locus polymorphisms, comprising 
a polymorphic sequence of SEQ ID NOS:87-124. 

10 3. A nucleic acid probe according to Claim 2, wherein said probe is conjugated 

to a detectable marker. 

4. An array of oligonucleotides comprising: 

two or more probes for detection of UGT1 locus polymorphisms, said probes 
1 5 comprising at least one fomi of a polymorphic sequences of SEQ ID NOS:87-1 24. 

5. A method for detecting in an individual a polymorphism in UGT1 metabolism 
of a substrate, the method comprising: 

analyzing the genome of said individual for the presence of at least one UGT1 
20 polymorphism of SEQ ID NOS:87-124; wherein the presence of said predisposing 
polymorphism is indicative of an alteration in UGT1 expression or activity. 

6. A method according to Claim 5, wherein said analyzing step comprises 
detection of specific binding between the genomic DNA of said individual with an an-ay of 

25 oligonucleotides comprising: 

two or more probes for detection of UGT1 locus polymorphisms, said probes 
comprising at least one form of a polymorphic sequence of SEQ ID NOS:87-124. 

7. A method according to Claim 5, wherein said alteration in UGT1 expression 
30 is tissue specific. 

8. A method according to Claim 5, wherein said alteration in UGT1 expression 
is in response to a UGT1 modifier. 

35 9. A method according to Claim 8. wherein said modifier induces UGT1 

expression. 
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10. A mettiod according to Claim 8, wlierein said modifier inhibits UGT1 
expression. 
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SEQUENCE LISTING 



<110> Penny, Laura 

Galvin, Margaret 



<120> Genotyping the Hiiman 

UDP-Glucuronosyltransf erase 1 (UGTl) Gene 

<130> SEQ-17W0 

<140> Unassigned 
<141> 1999-05-04 

<150> 60/084,807 
<151> 1998-05-07 

<160> 124 

<170> FastSEQ for Windows Version 3,0 

<210> 1 

<211> 864 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)...(864) 

<400> 1 

atg get gtg gag tec cag ggc gga cgc cca ctt gtc ctg ggc ctg ctg 48 

Met Ala Val Glu Ser Gin Gly Gly Arg Pro Leu Val Leu Gly Leu Leu 
1 5 10 15 

ctg tgt gtg ctg ggc cca gtg gtg tec cat get ggg aag ata ctg ttg 96 
Leu Cys Val Leu Gly Pro Val Val Ser His Ala Gly Lys He Leu Leu 
20 25 30 

ate cca gtg gat ggc age cac tgg ctg age atg ctt ggg gee ate cag 144 
He Pro Val Asp Gly Ser His Trp Leu Ser Met Leu Gly Ala He Gin 
35 40 45 

cag ctg cag cag agg gga cat gaa ata gtt gtc eta gea ect gae gee 192 
Gin Leu Gin Gin Arg Gly His Glu He Val Val Leu Ala Pro Asp Ala 
50 55 60 

teg ttg tac ate aga gae gga gea ttt tae ace ttg aag aeg tac ect 240 
Ser Leu Tyr He Arg Asp Gly Ala Phe Tyr Thr Leu Lys Thr Tyr Pro 
^5 70 75 80 

gtg cca ttc caa agg gag gat gtg aaa gag tct ttt gtt agt etc ggg 288 
Val Pro Phe Gin Arg Glu Asp Val Lys Glu Ser Phe Val Ser Leu Gly 
85 90 95 

cat aat gtt ttt gag aat gat tct ttc ctg cag cgt gtg ate aaa aca 336 
His Asn Val Phe Glu Asn Asp Ser Phe Leu Gin Arg Val He Lys Thr 
100 105 110 

tac aag aaa ata aaa aag gae tct get atg ctt ttg tct ggc tgt tec 384 
Tyr Lys Lys He Lys Lys Asp Ser Ala Met Leu Leu Ser Gly Cys Ser 
115 120 125 
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cac tta ctg cac aac aag gag etc atg gee tec ctg gca gaa age age 432 
His Leu Leu His Asn Lys Glu Leu Met Ala Ser Leu Ala Glu Ser Ser 
130 135 140 

ttt gat gtc atg ctg acg gac cct ttc ctt cct tgc age ccc ate gtg 480 

Phe Asp Val Met Leu Thr Asp Pro Phe Leu Pro Cys Ser Pro He Val 

145 150 155 

gcc cag tac ctg tct ctg ccc act gta ttc ttc ttg cat gca ctg cca 528 

Ala Gin Tyr Leu Ser Leu Pro Thr Val Phe Phe Leu His Ala Leu Pro 
165 170 175 

tgc age ctg gaa ttt gag get acc cag tgc ccc aac cca ttc tee tac 576 
Cys Ser Leu Glu Phe Glu Ala Thr Gin Cys Pro Asn Pro Phe Ser Tyr 
180 185 190 

gtg ccc agg cct etc tec tct cat tea gat cac atg acc ttc ctg cag 624 
Val Pro Arg Pro Leu Ser Ser His Ser Asp His Met Thr Phe Leu Gin 
195 200 205 

egg gtg aag aac atg etc att gcc ttt tea cag aac ttt ctg tgc gac 672 
Arg Val Lys Asn Met Leu He Ala Phe Ser Gin Asn Phe Leu Cys Asp 
210 215 220 

gtg gtt tat tec ccg tat gca acc ctt gcc tea gaa ttc ctt cag aga 720 
Val Val Tyr Ser Pro Tyr Ala Thr Leu Ala Ser Glu Phe Leu Gin Arg 
225 230 235 240 

gag gtg act gtc cag gac eta ttg age tct gca tct gtc tgg ctg ttt 768 
Glu Val Thr Val Gin Asp Leu Leu Ser Ser Ala Ser Val Trp Leu Phe 
245 250 255 

aga agt gac ttt gtg aag gat tac cct agg ccc ate atg ccc aat atg 816 
Arg Ser Asp Phe Val Lys Asp Tyr Pro Arg Pro He Met Pro Asn Met 
260 265 270 

gtt ttt gtt ggt gga ate aac tgc ctt cac caa aat cca eta tec cag 864 
Val Phe Val Gly Gly He Asn Cys Leu His Gin Asn Pro Leu Ser Gin 
275 280 285 



<210> 2 
<211> 288 
<212> PRT 

<213> Homo sapiens 
<400> 2 



Met 


Ala 


Val Glu Ser 


Gin Gly Gly 


1 




5 


Leu Cys 


Val Leu Gly 


Pro Val Val 






20 




He 


Pro 


Val Asp Gly 


Ser His Trp 






35 


40 


Gin 


Leu 


Gin Gin Arg 


Gly His Glu 




50 




55 


Ser 


Leu 


Tyr He Arg 


Asp Gly Ala 


65 






70 


Val 


Pro 


Phe Gin Arg 


Glu Asp Val 






85 


His 


Asn 


Val Phe Glu 


Asn Asp Ser 






100 


Tyr Lys 


Lys He Lys 


Lys Asp Ser 






115 


120 



Arg Pro Leu Val Leu Gly Leu Leu 

10 15 
Ser His Ala Gly Lys He Leu Leu 

25 30 
Leu Ser Met Leu Gly Ala He Gin 
45 

lie Val Val Leu Ala Pro Asp Ala 
60 

Phe Tyr Thr Leu Lys Thr Tyr Pro 

75 80 
Lys Glu Ser Phe Val Ser Leu Gly 

90 95 
Phe Leu Gin Arg Val He Lys Thr 
105 110 
Ala Met Leu Leu Ser Gly Cys Ser 
125 
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His Leu Leu His Asn Lys Glu Leu Met Ala Ser Leu Ala Glu Ser Ser 

130 135 140 

Phe Asp Val Met Leu Thr Asp Pro Phe Leu Pro Cys Ser Pro lie Val 

150 155 160 

Ala Gin Tyr Leu Ser Leu Pro Thr Val Phe Phe Leu His Ala Leu Pro 

165 170 175 

Cys Ser Leu Glu Phe Glu Ala Thr Gin Cys Pro Asn Pro Phe Ser Tvr 

180 185 190 

Val Pro Arg Pro Leu Ser Ser His Ser Asp His Met Thr Phe Leu Gin 

195 200 205 

Arg Val Lys Asn Met Leu He Ala Phe Ser Gin Asn Phe Leu Cys Asp 

210 215 220 

Val Val Tyr Ser Pro Tyr Ala Thr Leu Ala Ser Glu Phe Leu Gin Arq 
225 230 235 240 

Glu Val Thr Val Gin Asp Leu Leu Ser Ser Ala Ser Val Trp Leu Phe 

245 250 255 

Arg Ser Asp Phe Val Lys Asp Tyr Pro Arg Pro He Met Pro Asn Met 

260 265 270 

Val Phe Val Gly Gly lie Asn Cys Leu His Gin Asn Pro Leu Ser Gin 
275 280 285 

<210> 3 

<211> 867 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (867) 

<400> 3 

atg gcc aca gga etc cag gtt ccc ctg ccg tgg ctg gcc aca gga ctg 48 
Met Ala Thr Gly Leu Gin Val Pro Leu Pro Trp Leu Ala Thr Gly Leu 
^5 10 15 

ctg ctt etc etc agt gtc cag ccc tgg get gag agt gga aag gtg ttg 96 
Leu Leu Leu Leu Ser Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 
20 25 30 

gtg gtg ccc att gat gge age cac tgg etc age atg egg gag gtc ttg 144 
Val Val Pro He Asp Gly Ser His Trp Leu Ser Met Arg Glu Val Leu 
35 40 45 

egg gag etc cat gcc aga gge cac cag gea gtg gtc etc ace cea gag 192 
Arg Glu Leu His Ala Arg Gly His Gin Ala Val Val Leu Thr Pro Glu 
50 55 60 

gtg aat atg cac ate aaa gaa gag aac ttt ttc ace ctg aca ace tat 240 
Val Asn Met His He Lys Glu Glu Asn Phe Phe Thr Leu Thr Thr Tvr 
" 70 75 80 

gcc att teg tgg ace cag gat gaa ttt gat cgc eat gtg ctg gge cac 288 
Ala He Ser Trp Thr Gin Asp Glu Phe Asp Arg His Val Leu Gly His 
85 90 95 

act caa ctg tae ttt gaa aca gaa cat ttt ctg aag aaa ttt ttc aga 336 
Thr Gin Leu Tyr Phe Glu Thr Glu His Phe Leu Lys Lys Phe Phe Arg 
100 105 110 

agt atg gca atg ttg aac aat atg tct ttg gtc tat cat agg tet tgt 384 
Ser Met Ala Met Leu Asn Asn Met Ser Leu Val Tyr His Arg Ser Cvs 
115 120 125 
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gtg gag eta eta cat aat gag gcc ctg ate agg cac etg aat get act 4 32 

Val Glu Leu Leu His Asn Glu Ala Leu lie Arg His Leu Asn Ala Thr 
130 135 140 

tec ttt gat gtg gtt tta aca gac ccc gtt aac etc tge gcg gea gtg 480 
Ser Phe Asp Val Val Leu Thr Asp Pro Val Asn Leu Cys Ala Ala Val 
145 150 155 160 

ctg get aag tac ctg teg att ect act gtg ttt ttt ttg agg aac att 528 
Leu Ala Lys Tyr Leu Ser He Pro Thr Val Phe Phe Leu Arg Asn He 
165 170 175 

oca tgt gat tta gac ttt aag ggc aca cag tgt cea aac ect tec tec 576 
Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 
180 185 190 

tat att ect aga tta eta aca acc aat tea gac cac atg aca ttc atg 624 
Tyr He Pro Arg Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Met 
195 200 205 

caa agg gtc aag aac atg etc tac ect etg gee etg tec tac att tge 672 
Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr He Cys 
.210 215 220 

eat get ttt tet get ect tat gea age ctt gcc tct gag ctt ttt cag 720 
His Ala Phe Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

aga gag gtg tea gtg gtg gat att etc agt eat gea tct gtg tgg ctg 768 
Arg Glu Val Ser Val Val Asp He Leu Ser His Ala Ser Val Trp Leu 
245 250 255 

ttc ega ggg gac ttt gtg atg gac tac ccc agg cea ate atg ccc aac 816 
Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 
260 265 270 

atg gtc ttc att ggg ggc ate aac tgt gcc aac agg aag cea eta tet 864 
Met Val Phe He Gly Gly He Asn Cys Ala Asn Arg Lys Pro Leu Ser 
275 280 285 

cag 867 
Gin 



<210> 4 

<211> 289 

<212> PRT 

<213> Homo, sapiens 



<400> 4 



Met 
1 


Ala 


Thr 


Gly Leu 
5 


Gin 


Val Pro 


Leu 


Leu 


Leu 


Leu Ser 


Val 


Gin Pro 








20 






Val 


Val 


Pro 


He Asp 


Gly 


Ser His 






35 




40 


Arg Glu 


Leu 


His Ala 


Arg 


Gly His 




50 








55 


Val 


Asn 


Met 


His He 


Lys 


Glu Glu 


65 








70 




Ala 


He 


Ser Trp Thr 


Gin 


Asp Glu 



85 



Leu Pro Trp Leu Ala Thr Gly Leu 

10 15 
Trp Ala Glu Ser Gly Lys Val Leu 
25 30 
Trp Leu Ser Met Arg Glu Val Leu 
45 

Gin Ala Val Val Leu Thr Pro Glu 
60 

Asn Phe Phe Thr Leu Thr Thr Tyr 

75 80 
Phe Asp Arg His Val Leu Gly His 
90 95 
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Thr Gin Leu Tyr Phe Glu Thr Glu His Phe Leu Lys Lys Phe Phe Arq 

100 105 110 

Ser Met Ala Met Leu Asn Asn Met Ser Leu Val Tyr His Arg Ser Cvs 

120 125 
Val Glu Leu Leu His Asn Glu Ala Leu He Arg His Leu Asn Ala Thr 

130 135 140 

Ser Phe Asp Val Val Leu Thr Asp Pro Val Asn Leu Cys Ala Ala Val 

150 155 160 

Leu Ala Lys Tyr Leu Ser He Pro Thr Val Phe Phe Leu Arg Asn He 

1^5 170 
Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 

180 185 190 

Tyr He Pro Arg Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Met 

195 200 205 

Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr He Cvs 

210 215 220 

His Ala Phe Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

Arg Glu Val Ser Val Val Asp He Leu Ser His Ala Ser Val Trp Leu 

245 25Q 255 

Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 

260 265 270 

Met Val Phe He Gly Gly He Asn Cys Ala Asn Arg Lys Pro Leu Ser 
275 280 285 

Gin 



<210> 5 

<211> 867 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> {1) . . . (867) 

<400> 5 

atg gcc aga gga etc cag gtt ccc ctg ccg egg etg gee aea gga ctg 48 
Met Ala Arg Gly Leu Gin Val Pro Leu Pro Arg Leu Ala Thr Gly Leu 
15 10 15 

ctg etc etc etc agt gtc cag ecc tgg get gag agt gga aag gtg ttg 96 
Leu Leu Leu Leu Ser Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 
20 25 30 

gtg gtg ccc act gat ggc age eee tgg etc age atg egg gag gee ttg 144 
Val Val Pro Thr Asp Gly Ser Pro Trp Leu Ser Met Arg Glu Ala Leu 
35 40 45 

egg gag etc cat gcc aga ggc eac cag gcg gtg gtc etc ace eca gag 192 
Arg Glu Leu His Ala Arg Gly His Gin Ala Val Val Leu Thr Pro Glu 
50 55 60 

gtg aat atg eac ate aaa gaa gag aaa ttt tte ace ctg aca gcc tat 240 
Val Asn Met His He Lys Glu Glu Lys Phe Phe Thr Leu Thr Ala Tyr 
^5 70 75 80 



get gtt eca tgg ace cag aag gaa ttt gat cgc gtt acq etg ggc tae 
Ala Val Pro Trp Thr Gin Lys Glu Phe Asp Arg Val Thr Leu Gly Tyr 
85 90 95 

act caa ggg tte ttt gaa aea gaa eat ett ctg aag aga tat tct aga' 
Thr Gin Gly Phe Phe Glu Thr Glu His Leu Leu Lys Arg Tyr Ser Arg 
100 105 110 



288 



336 
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agt atg gca att atg aac aat gta tct ttg gcc ctt cat agg tgt tgt 384 
Ser Met Ala He Met Asn Asn Val Ser Leu Ala Leu His Arg Cys Cys 
115 120 125 

gtg gag eta ctg cat aat gag gcc ctg ate agg cac ctg aat get act 432 
Val Glu Leu Leu His Asn Glu Ala Leu He Arg His Leu Asn Ala Thr 
130 135 140 

tec ttt gat gtg gtt tta aca gac ccc gtt aac etc tgt ggg gcg gtg 480 
Ser Phe Asp Val Val Leu Thr Asp Pro Val Asn Leu Cys Gly Ala Val 

150 155 160 

ctg get aag tac ctg teg att cct get gtg ttt ttt tgg agg tac att 528 
Leu Ala Lys Tyr Leu Ser He Pro Ala Val Phe Phe Trp Arg Tyr He 
165 170 175 

eca tgt gac tta gac ttt aag ggc aca cag tgt cea aat cct tec tec 576 
Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 
180 185 190 

tat att cct aag tta eta acg ace aat tea gac cac atg aca ttc ctg 624 
Tyr He Pro Lys Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Leu 
195 200 205 

caa agg gtc aag aac atg etc tac cct ctg gcc ctg tec tac att tgc 672 
Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr He Cvs 
210 215 220 

cat act ttt tct gcc cct tat gca agt ctt gcc tct gag ctt ttt cag 720 
His Thr Phe Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

aga gag gtg tea gtg gtg gat ctt gtc age tat gca tee gtg tgg ctg 768 
Arg Glu Val Ser Val Val Asp Leu Val Ser Tyr Ala Ser Val Trp Leu 
245 250 255 

ttc cga ggg gac ttt gtg atg gac tac ccc agg ccg ate atg ccc aac 816 
Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 
260 265 270 

atg gtc ttc att ggg ggc ate aac tgt gcc aac ggg aag eca eta tct 864 
Met Val Phe He Gly Gly He Asn Cys Ala Asn Gly Lys Pro Leu Ser 
275 280 285 



cag 
Gin 



<210> 6 
<211> 289 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met Ala Arg Gly Leu Gin Val Pro Leu Pro Arg Leu Ala Thr Gly Leu 

1 ^ 5 10 15 

Leu Leu Leu Leu Ser. Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 

20 25 30 

Val Val Pro Thr Asp Gly Ser Pro Trp Leu Ser Met Arg Glu Ala Leu 

35 40 45 

Arg Glu Leu His Ala Arg Gly His Gin Ala Val Val Leu Thr Pro Glu 
50 55 60 



867 
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Val Asn Met His lie Lys Glu Glu Lys Phe Phe Thr Leu Thr Ala Tyr 
65 70 75 80 

Ala Val Pro Trp Thr Gin Lys Glu Phe Asp Arg Val Thr Leu Gly Tyr 

85 90 95 

Thr Gin Gly Phe Phe Glu Thr Glu His Leu Leu Lys Arg Tyr Ser Arg 

100 105 110 

Ser Met Ala lie Met Asn Asn Val Ser Leu Ala Leu His Arg Cys Cys 

115 120 125 

Val Glu Leu Leu His Asn Glu Ala Leu lie Arg His Leu Asn Ala Thr 

130 135 140 

Ser Phe Asp Val Val Leu Thr Asp Pro Val Asn Leu Cys Gly Ala Val 
145 150 155 160 

Leu Ala Lys Tyr Leu Ser He Pro Ala Val Phe Phe Trp Arg Tyr He 

165 170 175 

Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 

180 185 190 

Tyr He Pro Lys Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Leu 

195 200 205 

Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr He Cys 

210 215 220 

His Thr Phe Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

Arg Glu Val Ser Val Val Asp Leu Val Ser Tyr Ala Ser Val Trp Leu 

245 250 255 

Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 

260 265 270 

Met Val Phe He Gly Gly He Asn Cys Ma Asn Gly Lys Pro Leu Ser 
275 280 285 

Gin 



<210> 7 

<211> 867 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (867) 



<400> 7 

atg gcc aca gga etc cag gtt ccc ctg ccg cag ctg gcc aca gga ctg 48 

Met Ala Thr Gly Leu Gin Val Pro Leu Pro Gin Leu Ala Thr Gly Leu 
15 10 15 

ctg Gtt etc etc agt gtc cag ccc tgg get gag agt ggg aag gtg ctg 96 

Leu Leu Leu Leu Ser Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 
20 25 30 

gtg gtg ccc act gat ggc age cac tgg etc age atg egg gag gcc ttg 144 

Val Val Pro Thr Asp Gly Ser His Trp Leu Ser Met Arg Glu Ala Leu 
35 40 45 



egg gac etc cat geg aga ggc cac cag gtg gtg gtc etc ace ctg gag 192 

Arg Asp Leu His Ala Arg Gly His Gin Val Val Val Leu Thr Leu Glu 

50 55 60 

gtg aat atg tac ate aaa gaa gag aac ttt ttc ace ctg aca acg tat 240 

Val Asn Met Tyr He Lys Glu Glu Asn Phe Phe Thr Leu Thr Thr Tyr 

65 70 75 80 

gcc att tea tgg ace cag gac gaa ttt gat cgc ett ttg ctg ggt cac 288 

Ala He Ser Trp Thr Gin Asp Glu Phe Asp Arg Leu Leu Leu Gly His 

85 90 95 
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act caa teg ttc ttt gaa aca gaa cat ctt ctg atg aaa ttt tct aqa 336 
Thr Gin Ser Phe Phe Glu Thr Glu His Leu Leu Met Lys Phe Ser Ara 
100 105 110 

aga atg gca att atg aac aat atg tct ttg ate ata cat agg tct tqt 384 
Arg Met Ala He Met Asn Asn Met Ser Leu He He His Arg Ser Cvs 
115 120 125 

gtg gag eta ctg cat aat gag gcc ctg ate agg cac ctg cat get act 432 
Val Glu Leu Leu His Asn Glu Ala Leu He Arg His Leu His Ala Thr 
130 135 

tec ttt gat gtg gtt eta aca gac ccc ttt cac etc tgc gcg gcg gtg 480 
Ser Phe Asp Val Val Leu Thr Asp Pro Phe His Leu Cys Ala Ala Val 

150 155 160 

ctg get aag tac ctg teg att cct get gtg ttt ttc ttg agg aac att 528 
Leu Ala Lys Tyr Leu Ser He Pro Ala Val Phe Phe Leu Arg Asn He 
165 170 175 

cea tgt gat tta gac ttt aag ggc aca cag tgt cca aac cct tec tee 576 
Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 
180 185 190 

tat att cct aga tta eta aeg acc aat tea gac cac atg aca ttc ctg 624 
Tyr He Pro Arg Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Leu 
1^5 200 205 

caa agg gte aag aac atg etc tac cct ctg gcc ctg tec tac ctt tgc 672 
Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr Leu Cvs 
210 215 220 

eat get gtt tct get cct tat gca age ctt gcc tct gag ctt ttt cag 720 
His Ala Val Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

aga gag gtg tea gtg gtg gat ctt gtc age cat gca tct gtg tgg ctg 768 
Arg Glu Val Ser Val Val Asp Leu Val Ser His Ala Ser Val Trp Leu 
245 250 255 

ttc ega ggg gac ttt gtg atg gat tac ccc agg ceg ate atg ccc aac 816 
Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 
260 265 270 

atg gtc ttc att ggg gge ate aac tgt gcc aac ggg aag cea eta tct 864 
Met Val Phe He Gly Gly He Asn Cys Ala Asn Gly Lys Pro Leu Ser 
275 280 285 

Hi 



<210> 8 
<211> 289 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Ala Thr Gly Leu Gin Val Pro Leu Pro Gin Leu Ala Thr Glv Leu 

1 5 10 je;^ 

Leu Leu Leu Leu Ser Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 
20 25 30 
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Val Val Pro Thr Asp Gly Ser His Trp Leu Ser Met Arg Glu Ala Leu 

35 40 45 

Arg Asp Leu His Ala Arg Gly His Gin Val Val Val Leu Thr Leu Glu 

50 55 60 

Val Asn Met Tyr lie Lys Glu Glu Asn Phe Phe Thr Leu Thr Thr Tvr 
65 70 75 80 

Ala He Ser Trp Thr Gin Asp Glu Phe Asp Arg Leu Leu Leu Gly His 

85 90 95 

Thr Gin Ser Phe Phe Glu Thr Glu His Leu Leu Met Lys Phe Ser Ara 

100 105 
Arg Met Ala He Met Asn Asn Met Ser Leu He He His Arg Ser Cys 

115 120 125 

Val Glu Leu Leu His Asn Glu Ala Leu He Arg His Leu His Ala Thr 

130 135 140 

Ser Phe Asp Val Val Leu Thr Asp Pro Phe His Leu Cys Ala Ala Val 
145 150 155 160 

Leu Ala Lys Tyr Leu Ser He Pro Ala Val Phe Phe Leu Arg Asn He 

165 170 175 

Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 

180 185 190 

Tyr He Pro Arg Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Leu 

195 200 205 

Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr Leu Cys 

210 215 220 

His Ala Val Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

Arg Glu Val Ser Val Val Asp Leu Val Ser His Ala Ser Val Trp Leu 

245 250 255 

Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 

260 265 270 

Met Val Phe He Gly Gly He Asn Cys Ala Asn Gly Lys Pro Leu Ser 
275 280 285 

Gin 



<210> 9 

<211> 861 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) , . . (861) 



<400> 9 

atg gcc tgc etc ctt cgc tea ttt cag aga att tct gca ggg gtt ttc 
Met Ala Cys Leu Leu Arg Ser Phe Gin Arg He Ser Ala Gly Val Phe 
^5 10 15 

ttc tta gca ctt tgg ggc atg gtt gta ggt gac aag ctg ctg gtg gtc 
Phe Leu Ala Leu Trp Gly Met Val Val Gly Asp Lys Leu Leu Val Val 
20 25 30 

cct cag gac gga age cac tgg ctt agt atg aag gat ata gtt gag gtt 
Pro Gin Asp Gly Ser His Trp Leu Ser Met Lys Asp He Val Glu Val 
35 40 45 

etc agt gac egg ggt cat gag att gta gtg gtg gtg cct gaa gtt aat 
Leu Ser Asp Arg Gly His Glu He Val Val Val Val Pro Glu Val Asn 
50 55 60 

ttg ctt ttg aaa gaa tec aaa tac tac aca aga aaa ate tat eca gtg 
Leu Leu Leu Lys Glu Ser Lys Tyr Tyr Thr Arg Lys He Tyr Pro Val 
^5 70 75 80 



48 



96 



144 



192 



240 
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ccg tat gac caa gaa gag ctg aag aac cgt tac caa tea ttt gga aae 288 
Pro Tyr Asp Gin Glu Glu Leu Lys Asn Arg Tyr Gin Ser Phe Gly Asn 
85 90 95 

aat cac ttt get gag cga tea ttc eta aet get cct cag aca gag tac 336 
Asn Hxs Phe Ala Glu Arg Ser Phe Leu Thr Ala Pro Gin Thr Glu Tyr 
100 105 110 



agg aat aac atg att gtt att ggc ctg tac ttc ate aac tgc cag age 
Arg Asn Asn Met lie Val He Gly Leu Tyr Phe He Asn Cys Gin Ser 
115 120 125 

etc ctg cag gac agg gac acc ctg aac ttc ttt aag gag age aag ttt 
Leu Leu Gin Asp Arg Asp Thr Leu Asn Phe Phe Lys Glu Ser Lys Phe 
130 135 140 

gat get ett ttc aca gac cca gee tta ecc tgt ggg gtg ate ctg get 
Asp Ala Leu Phe Thr Asp Pro Ala Leu Pro Cys Gly Val He Leu Ala 

150 155 160 

gag tat ttg ggc eta cca tct gtg tac etc ttc agg ggt ttt ccg tgt 
Glu Tyr Leu Gly Leu Pro Ser Val Tyr Leu Phe Arg Gly Phe Pro Cys 
165 170 175 

tec ctg gag cat aca ttc age aga age cca gac cct gtg tec tac att 
Ser Leu Glu His Thr Phe Ser Arg Ser Pro Asp Pro Val Ser Tyr He 
180 185 190 

ccc agg tgc tac aca aag ttt tea gac cac atg act ttt tec caa cga 
Pro Arg Cys Tyr Thr Lys Phe Ser Asp His Met Thr Phe Ser Gin Arg 
195 200 205 

gtg gee aac ttc ett gtt aat ttg ttg gag ccc tat eta ttt tat tgt 
Val Ala Asn Phe Leu Val Asn Leu Leu Glu Pro Tyr Leu Phe Tvr Cvs 
210 215 220 

ctg ttt- tea aag tat gaa gaa etc gea tea get gtc etc aag aga gat 
Leu Phe Ser Lys Tyr Glu Glu Leu Ala Ser Ala Val Leu Lys Arq Asp 
225 230 235 240 

gtg gat ata ate ace tta tat cag aag gtc tct gtt tgg ctg tta aga 
Val Asp He He Thr Leu Tyr Gin Lys Val Ser Val Trp Leu Leu Arg 
245 250 255 

tat gac ttt gtg ett gaa tat cct agg ccg gtc atg ccc aac atg gtc 
Tyr Asp Phe Val Leu Glu Tyr Pro Arg Pro Val Met Pro Asn Met Val 
260 265 270 

ttc att gga ggt ate aac tgt aag aag agg aaa gac ttg tct cag 
Phe He Gly Gly He Asn Cys Lys Lys Arg Lys Asp Leu Ser Gin 
275 280 285 

<210> 10 
<211> 287 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Ala Cys Leu Leu Arg Ser Phe Gin Arg lie Ser Ala Gly Val Phe 

-^5 10 15 

Phe Leu Ala Leu Trp Gly Met Val Val Gly Asp Lys Leu Leu Val Val 
20 25 30 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



861 
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Pro Gin Asp Gly Ser His Trp Leu Ser Met Lys Asp He Val Glu Val 

35 40 . 45 

Leu Ser Asp Arg Gly His Glu He Val Val Val Val Pro Glu Val Asn 

50 55 60 

Leu Leu Leu Lys Glu Ser Lys Tyr Tyr Thr Arg Lys He Tyr Pro Val 
65 70 75 80 

Pro Tyr Asp Gin Glu Glu Leu Lys Asn Arg Tyr Gin Ser Phe Gly Asn 

85 90 95 

Asn His Phe Ala Glu Arg Ser Phe Leu Thr Ala Pro Gin Thr Glu Tyr 

100 105 110 

Arg Asn Asn Met He Val He Gly Leu Tyr Phe He Asn Cys Gin Ser 

115 120 125 

Leu Leu Gin Asp Arg Asp Thr Leu Asn Phe Phe Lys Glu Ser Lys Phe 

130 135 140 

Asp Ala Leu Phe Thr Asp Pro Ala Leu Pro Cys Gly Val He Leu Ala 
145 150 155 160 

Glu Tyr Leu Gly Leu Pro Ser Val Tyr Leu Phe Arg Gly Phe Pro Cys 

165 170 175 

Ser Leu Glu His Thr Phe Ser Arg Ser Pro Asp Pro Val Ser Tyr He 

180 185 190 

Pro Arg Cys Tyr Thr Lys Phe Ser Asp His Met Thr Phe Ser Gin Arg 

195 200 205 

Val Ala Asn Phe Leu Val Asn Leu Leu Glu Pro Tyr Leu Phe Tyr Cys 

210 215 220 

Leu Phe Ser Lys Tyr Glu Glu Leu Ala Ser Ala Val Leu Lys Arg Asp 
225 23a 235 240 

Val Asp He He Thr Leu Tyr Gin Lys Val Ser Val Trp Leu Leu Arg 

245 250 255 

Tyr Asp Phe Val Leu Glu Tyr Pro Arg Pro Val Met Pro Asn Met Val 

260 265 270 

Phe He Gly Gly He Asn Cys Lys Lys Arg Lys Asp Leu Ser Gin 
275 280 285 

<210> 11 

<211> 951. 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (951) 

<40O> 11 

atg get cgt gca ggg tgg act ggc etc ctt cec eta tat gtg tgt eta 48 

Met Ala Arg Ala Gly Trp Thr Gly Leu Leu Pro Leu Tyr Val Cys Leu 

15 10 15 

ctg ctg acc tgt get ttg oca agg tea ggg aag ctg ctg gta gtg ccc 96 
Leu Leu Thr Cys Ala Leu Pro Arg Ser Gly Lys Leu Leu Val Val Pro 
20 25 30 

atg gat ggg age cae tgg tte aec atg cag teg gtg gtg gag aaa etc 144 
Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 
35 40 45 

ate etc agg ggg cat gag gtg gtc gta gtc atg cca gag gtg agt tgg 192 
He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 
50 55 60 

caa ctg gga aga tea ctg aat tgc aea gtg aag act tac tea ace tea 240 
Gin Leu Gly Arg Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
65 70 75 80 



-11- 



wo 99/57322 PCT/US99/09702 

tac act ctg gag gat cag gac egg gag ttc atg gtt ttt gcc gat get 288 
Tyr Thr Leu Glu Asp Gin Asp Arg Glu Phe Met Val Phe Ala Asp Ala 
85 90 95 

egc tgg acg gca cca ttg cga agt gea ttt tct eta tta aca agt tea 336 
Arg Trp Thr Ala Pro Leu Arg Ser Ala Phe Ser Leu Leu Thr Ser Ser 
100 105 110 

tec aat ggt att ttt gac tta ttt ttt tea aat tge agg agt ttg ttt 384 
Ser Asn Gly He Phe Asp Leu Phe Phe Ser Asn Cys Arg Ser Leu Phe 
115 120 125 

aat gac cga aaa tta gta gaa tac tta aag gag agt tgt ttt gat gca 432 
Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Cys Phe Asp Ala 
130 135 140 

gtg ttt etc gat ect ttt gat egc tgt gge tta att gtt gee aaa tat 480 
Val Phe Leu Asp Pro Phe Asp Arg Cys Gly Leu He Val Ala Lys Tyr 
145 150 155 160 

ttc tec etc cce tct gtg gtc ttc gcc agg gga ata ttt tge cae tat 528 
Phe Ser Leu Pro Ser Val Val Phe Ala Arg Gly He Phe Cys His Tyr 
165 170 175 

ctt gaa gaa ggt gca cag tge ect get cct ett tee tat gtc cce aga 576 
Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Arg 
180 185 190 

ctt etc tta ggg ttc tea gac gee atg act ttc aag gag aga gta tgg 624 
Leu Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Trp 
195 200 205 

aae cae ate atg cae ttg gag gaa eat tta ttt tge cce tat ttt ttc 672 
Asn His He Met His Leu Glu Glu His Leu Phe Cys Pro Tyr Phe Phe 
210 215 220 

aaa aat gtc tta gaa ata gcc tct gaa att etc eaa ace cct gtc aeg 720 
Lys Asn Val Leu Glu He Ala Ser Giu He Leu Gin Thr Pro Val Thr 
225 230 235 240 

gea tat gat etc tac age cae aca tea att tgg ttg ttg cga act gac 768 
Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 
245 250 255 

ttt gtt ttg gag tat cce aaa cce gtg atg cce aat atg ate ttc att 816 
Phe Val Leu Glu Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 
260 265 270 

ggt ggt ate aac tgt eat cag gga aag cca gtg cct atg gta agt tat &64 
Gly Gly He Asn Cys His Gin Gly Lys Pro Val Pro Met Val Ser Tyr 
275 280 285 

etc cec ttt age aca tta aga ata ate tgg ctt tgg aaa tta aaa gat 912 
Leu Pro Phe Ser Thr Leu Arg He He Trp Leu Trp Lys Leu Lys Asp 
290 295 300 

ttc tta cag aat cat aat tta tea ttt aca ttt gtc cca 951 
Phe Leu Gin Asn His Asn Leu Ser Phe Thr Phe Val Pro 
305 310 315 



<210> 12 
<211> 317 
<212> PRT 
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<213> Homo sapiens 

<400> 12 

Met Ala Arg Ala Gly Trp Thr Gly Leu Leu Pro Leu Tyr Val Cys Leu 

15 10 15 

Leu Leu Thr Cys Ala Leu Pro Arg Ser Gly Lys Leu Leu Val Val Pro 

20 25 30 

Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 

35 40 45 

He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 

50 55 60 

Gin Leu Gly Arg Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 

70 75 80 

Tyr Thr Leu Glu Asp Gin Asp Arg Glu Phe Met Val Phe Ala Asp Ala 

85 90 95 

Arg Trp Thr Ala Pro Leu Arg Ser Ala Phe Ser Leu Leu Thr Ser Ser 

100 105 110 

Ser Asn Gly He Phe Asp Leu Phe Phe Ser Asn Cys Arg Ser Leu Phe 

115 120' 125 

Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Cys Phe Asp Ala 
130 135 

Val Phe Leu Asp Pro Phe Asp Arg Cys Gly Leu He Val Ala Lys Tyr 
145 150 155 160 

Phe Ser Leu Pro Ser Val Val Phe Ala Arg Gly He Phe Cys His Tyr 

165 170 175 

Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Ara 

180 185 190 

Leu Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Trp 

195 200 205 

Asn His He Met His Leu Glu Glu His Leu Phe Cys Pro Tyr Phe Phe 

210 215 220 

Lys Asn Val Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 
225 230 235 240 • 

Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 

245 250 255 

Phe Val Leu Glu Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 

260 265 270 

Gly Gly He Asn Cys His Gin Gly Lys Pro Val Pro Met Val Ser Tyr 

275 280 285 

Leu Pro Phe Ser Thr Leu Arg He He Trp Leu Trp Lys Leu Lys Asp 

290 295 300 

Phe Leu Gin Asn His Asn Leu Ser Phe Thr Phe Val Pro 
305 310 315 

<210> 13 

<211> 930 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)_.(930) 

<400> 13 

atg get cgc aca ggg tgg acc age ccc att ccc eta tgt gtt tct ctg 48 

Met Ala Arg Thr Gly Trp Thr Ser Pro He Pro Leu Cys Val Ser Leu 
1 5 10 15 

ctg ctg acc tgt ggc ttt get gag gca ggg aag ctg ctg gta gtg ccc 96 
Leu Leu Thr Cys Gly Phe Ala Glu Ala Gly Lys Leu Leu Val Val Pro 
20 25 30 
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atg gat ggg agt cac tgg ttc acc atg cag teg gtg gtg gag aaa ctt 144 
Met Asp Gly Ser His Trp Phe Thr.Met Gin Ser Val Val Glu Lys Leu 
35 40 45 

ate cte agg ggg cat gag gtg gtt gta gtc atg cca gag gtg agt tgg 192 
He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 
50 55 60 

caa ctg gga aaa tea ctg aat tgc aca gtg aag act tac tea acc tea 240 
Gin Leu Gly Lys Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
65 70 75 80 

tac act ctg gag gat ctg gac egg gaa ttc atg gat ttc gee gat get 288 
Tyr Thr Leu Glu Asp Leu Asp Arg Glu Phe Met Asp Phe Ala Asp Ala 
85 90 95 

caa tgg aaa gea caa gta ega agt ttg ttt tet eta. ttt ctg agt tea 336 
Gin Trp Lys Ala Gin Val Arg Ser. Leu Phe Ser Leu Phe Leu Ser Ser 
100 105 110 

tec aat ggt ttt ttt aae tta ttt ttt teg cat tgc agg agt ttg ttt 384 
Ser Asn Gly Phe Phe Asn Leu Phe Phe Ser His Cys Arg Ser Leu Phe 
115 120 125 

aat gac ega aaa tta gta gaa tac tta aag gag agt tet ttt gat geg 432 
Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Ser Phe Asp Ala 
130 135 140 

gtg ttt ctt gat cet ttt gat gee tgt gcg tta att gtt gee aaa tat 4 80 

Val Phe Leu Asp Pro Phe Asp Ala Cys Ala Leu lie Val Ala Lys Tvr 

150 155 160 

ttc tec etc cec tet gtg gtc ttc gee agg gga ata ggt tgc cac tat 528 
Phe Ser Leu Pro Ser Val Val Phe Ala Arg Gly He Gly Cys His Tyr 
165 170 175 

ctt gaa gaa ggt gea cag tgc cet get ect ctt tec tat gtc cec aga 576 
Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Arg 
180 185 190 

att etc tta ggg ttc tea gat gee atg act ttc aag gag aga gta egg 624 
He Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Ara 
195 200 205 

aac cac ate atg cac ttg gag gaa cat tta ttt tgc cag tat ttt tee 672 
Asn His He Met His Leu Glu Glu His Leu Phe Cys Gin Tyr Phe Ser 
210 215 220 

aaa aat gee eta gaa ata gee tet gaa att etc caa aca cet gtc aca 720 
Lys Asn Ala Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 
225 230 235 240 

gea tat gat etc tac age cac aca tea att tgg ttg ttg ega aca gac 768 
Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 
245 250 255 

ttt gtt ttg gac tat cec aaa cec gtg atg cec aat atg ate ttc att 816 
Phe Val Leu Asp Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 
260 265 270 

ggt ggt ate aae tgc cat cag gga aag cca ttg cet atg gta agt cac 864 
Gly Gly He Asn Cys His Gin Gly Lys Pro Leu Pro Met Val Ser His 
275 280 285 
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etc tec ttt age aca tta gga ata ate ttg get ttg gaa att aaa aaa 912 
Leu Ser Phe Sex Thr Leu Gly He .He Leu Ala Leu Glu He Lys Lys 
290 295 300 

aga ttc ctt act gaa ttg 93O 
Arg Phe Leu Thr Glu Leu 
305 310 



<210> 14 

<211> 310 

<212> PRT 

<213> Homo sapiens 



<400> 14 



I. 


Ala 


rixg inr 




irp inr ser 


Pro 


He 


Pro Leu Cys Val Ser Leu 


1 






5 
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15 


Leu 


_ 

Leu 


inr v^ys 
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so 
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Leu 
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UKS U VJ X LI 
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X 
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Val 
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P-rn 
IrxO 
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Leu 


He 
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145 








1 so 
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Phe 
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Ser 
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He 
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Leu 


Glu 


Glu Gly 


Ala 


Gin Cys Pro 


Ala 


Pro 


Leu 


Ser 


Tyr Val Pro Arg 






180 
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He 


Leu 


Leu Gly 
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Met 
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Phe 


Lys 


Glu Arg Val Arg 
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He Met 


His 


Leu Glu Glu 
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Leu 


Phe 
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He 
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Gin 
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Tyr 
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He 
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250 
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<210> 15 

<211> 759 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (759) 
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<400> 15 

atg gat ggg agt cac tgg ttc acc atg cag teg gtg gtg gag aaa ctt 4 8 

Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 
1 5 . 10 15 

ate etc agg ggg cat gag gtg gtt gta gtc atg cca gag gtg agt tgg 96 
He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 
20 25 30 

caa Gtg gaa aga tea ctg aat tgc aca gtg aag act tac tea acc teg 144 
Gin Leu Glu Arg Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
35 40 45 

tac act ctg gaa gat cag aac egg gaa ttc atg gtt ttc gee cat get 192 
Tyr Thr Leu Glu Asp Gin Asn Arg Glu Phe Met Val Phe Ala His Ala 
50 55 60 

caa tgg aaa gca cag gea caa agt- ata ttt tet eta tta atg agt tea 240 
Gin Trp Lys Ala Gin Ala Gin Ser He Phe Ser Leu Leu Met Ser Ser 
65 70 75 80 

tec agt ggt ttt ctt gae tta ttt ttt teg cat tgc agg agt ttg ttt 288 
Ser Ser Gly Phe Leu Asp Leu Phe Phe Ser His Cys Arg Ser Leu Phe 
85 90 95 

aat gae ega aaa tta gta gaa tac tta aag gag agt tet ttt gat gca 336 
Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Ser Phe Asp Ala 
100 105 110 

gtg ttt ctg gat ect ttt gat acc tgt ggc tta att gtt get aaa tat 384 
Val Phe Leu Asp Pro Phe Asp Thr Cys Gly Leu He Val Ala Lys Tyr 
115 120 125 

ttc tec etc cee tet gtg gtc ttc acc agg gga ata ttt tgc cac cat 432 
Phe Ser Leu Pro Ser Val Val Phe Thr Arg Gly He Phe Cys His His 

130 135 140 

ctt gaa gaa ggt gca cag tgc cct get ect ctt tec tat gtc cee aat 480 
Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Asn 
145 150 155 160 

gat etc tta ggg ttc tea gat gee atg act ttc aag gag aga gta tgg 528 
Asp Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Trp 
165 170 175 

aac cac ate gtg cac ttg gag gae cat tta ttt tgc cag tat ctt ttt 576 
Asn His He Val His Leu Glu Asp His Leu Phe Cys Gin Tyr Leu Phe 
180 185 190 

aga aat gee eta gaa ata gee tet gaa att etc caa ace ect gtc aeg 624 
Arg Asn Ala Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 
195 200 205 

gca tat gat etc tac agt cac aca tea att tgg ttg ttg cga acg gae 672 
Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 
210 215 220 

ttt gtt ttg gae tat cee aaa cee gtg atg ecc aac atg ate ttc att 720 
Phe Val Leu Asp Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 
225 230 235 240 

ggt ggt ate aac tgt cat cag gga aag cca ttg ect atg 759 
Gly Gly He Asn Cys His Gin Gly Lys Pro Leu Pro Met 
245 250 
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<210> 16 

<211> 253 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 

15 10 15 

He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 

20 25 30 

Gin Leu Glu Arg Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 

35 40 45 

Tyr Thr Leu Glu Asp Gin Asn Arg Glu Phe Met Val Phe Ala His Ala 

50 55 60 

Gin Trp Lys Ala Gin Ala Gin Ser He Phe Ser Leu Leu Met Ser Ser 
65 70 75 80 

Ser Ser Gly Phe Leu Asp Leu Phe Phe Ser His Cys Arg Ser Leu Phe 

85 90 95 

Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Ser Phe Asp Ala 

100 105 110 

Val Phe Leu Asp Pro Phe Asp Thr Cys Gly Leu He Val Ala Lys Tyr 

115 120 125 

Phe Ser Leu Pro Ser Val Val Phe Thr Arg Gly He Phe Cys His His 

130 135 140 

Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Asn 
145 150 155 160 

Asp Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Trp 

165 170 175 

Asn His He Val His Leu Glu Asp His Leu Phe Cys Gin Tyr Leu Phe. 

180 * 185 190 

Arg Asn Ala Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 

195 200 205 

Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 

210 215 220 

Phe Val Leu Asp Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 
225 230 235 240 

Gly Gly He Asn Cys His Gin Gly Lys Pro Leu Pro Met 
245 250 

<210> 17 

<211> 735 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)...{735) 

<400> 17 

gaa ttt gaa gcc tac att aat get tct gga gaa cat gga att gtg gtt 

Glu Phe Glu Ala Tyr He Asn Ala Ser Gly Glu His Gly He Val Val 
15 10 15 

ttc tct ttg gga tea atg gtc tea gaa att cca gag aag aaa get atg 
Phe Ser Leu Gly Ser Met Val Ser Glu He Pro Glu Lys Lys Ala Met 
20 25 30 



48 



96 



gca att get gat get ttg ggc aaa ate cct cag aca gtc etg tgg egg 144 
Ala He Ala Asp Ala Leu Gly Lys He Pro Gin Thr Val Leu Trp Arg 
35 40 45 
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tac act gga acc cga cca teg aat ctt gcg aac aac acg ata ctt gtt 192 
Tyr Thr Gly Thr Arg Pro Ser Asn.Leu Ala Asn Asn Thr He Leu Val 
50 55 60 

aag tgg eta ccc caa aac gat ctg ctt ggt cac ccg atg acc cgt gcc 240 
Lys Trp Leu Pro Gin Asn Asp Leu Leu Gly His Pro Met Thr Arg Ala 
65 70 75 80 

ttt ate acc cat get ggt tec cat ggt gtt tat gaa age ata tgc aat 288 
Phe He Thr His Ala Gly Ser His Gly Val Tyr Glu Ser He Cys Asn 
85 90 95 

ggc gtt ccc atg gtg atg atg ccc ttg ttt ggt gat cag atg gac aat 336 
Gly Val Pro Met Val Met Met Pro Leu Phe Gly Asp Gin Met Asp Asn 
100 105 110 

gca aag cgc atg gag act aag gga get gga gtg acc ctg aat gtt ctg 384 
Ala Lys Arg Met Glu Thr Lys Gly. Ala Gly Val Thr Leu Asn Val Leu 
115 120 125 

gaa atg act tct gaa gat tta gaa aat get eta aaa gca gte ate aat 432 
Glu Met Thr Ser Glu Asp Leu Glu Asn Ala Leu Lys Ala Val lie Asn 
130 135 140 

gac aaa agt tac aag gag aac ate atg cgc etc tec age ctt cac aag 480 
Asp Lys Ser Tyr Lys Glu Asn He Met Arg Leu Ser Ser Leu His Lys 
145 150 155 160 

gac cgc ccg gtg gag ccg ctg gac ctg gcc gtg ttc tgg gtg gag ttt 528 
Asp Arg Pro Val Glu Pro Leu Asp Leu Ala Val Phe Trp Vatl Glu Phe 
165 170 175 

gtg atg agg cac aag ggc gcg cca cac ctg cgc ccc gca gcc cac gac 576 
Val Met Arg His Lys Gly Ala Pro His Leu Arg Pro Ala Ala His Asp 
180 185 190 

etc acc tgg tac cag tac eat tec ttg gac gtg att ggt ttc etc ttg 624 
Leu Thr Trp Tyr Gin Tyr His Ser Leu Asp Val He Gly Phe Leu Leu 
195 200 205 

gcc gte gtg ctg aca gtg gcc ttc ate acc ttt aaa tgt tgt get tat 672 
Ala Val Val Leu Thr Val Ala Phe He Thr Phe Lys Cys Cys Ala Tyr 
210 215 220 

ggc tac egg aaa tgc ttg ggg aaa aaa ggg cga gtt aag aaa gee cac 720 
Gly Tyr Arg Lys Cys Leu Gly Lys Lys Gly Arg Val Lys Lys Ala His 
225 230 235 240 

aaa tee aag ace eat 735 
Lys Ser Lys Thr His 
245 



<210> 18 
<211> 245 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Glu Phe Glu Ala Tyr He Asn Ala Ser Gly Glu His Gly He Val Val 

1 5 10 15 

Phe Ser Leu Gly Ser Met Val Ser Glu He Pro Glu Lys Lys Ala Met 
20 25 30 
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Ala 


lie Ala Asp Ala 


Leu 


Gly 


Lys lie Pro Gin 


Thr 


Val 


Leu Trp Arg 






35 








40 




45 




Tvr 


Thr 


Gly Thr Arg 


Pro 


Ser 


Asn Leu Ala Asn 


Asn 


Thr 


He Leu Val 


50 








55 




60 






Lys 


Trp 


Leu Pro 


Gin 


Asn 


Asp 


Leu Leu Gly His 


Pro 


Met 


Thr Arg Ala 


65 






70 




75 






80 


Phe 


lie 


Thr His 


Ala 


Gly 


Ser 


His Gly Val Tyr 


Glu 


Ser 


He Cys Asn 








85 






90 






95 


Gly 


Val 


Pro Met 


Val 


Met 


Met 


Pro Leu Phe Gly Asp Gin Met Asp Asn 




100 








105 






110 


Ala 


Lys 


Arg Met 


Glu 


Thr 


Lys 


Gly Ala Gly Val 


Thr 


Leu 


Asn Val Leu 




115 








120 




125 




Glu 


Met 


Thr Ser 


Glu 


Asp 


Leu 


Glu Asn Ala Leu Lys Ala 


Val He Asn 




130 








135 




140 






ASD 


Lys 


Ser Tyr 


Lvs 


Glu 


Asn 


He Met Arg Leu 


Ser 


Ser 


Leu His Lys 


145 






150 




155 






160 


Asp 


Arg 


Pro Val 


Glu 


Pro 


Leu 


Asp Leu Ala Val 


Phe 


Trp Val Glu Phe 




165 






170 






175 


Val 


Met 


Arg His 


Lys 


Gly 


Ala 


Pro His Leu Arg 


Pro 


Ala 


Ala His Asp 






180 








185 






190 


Leu 


Thr Trp Tyr 


Gin 


Tyr 


His 


Ser Leu Asp Val 


He 


Gly 


Phe Leu Leu 






195 








200 




205 




Ala 


Val 


Val Leu 


Thr 


Val 


Ala 


Phe He Thr Phe Lys Cys Cys Ala Tyr 




210 








215 




220 






Gly 


Tyr Arg Lys 


Cys 


Leu 


Gly 


Lys Lys Gly Arg 


Val 


Lys 


Lys Ala His 


225 








230 




235 






240 


Lys 


Ser 


Lys Thr 


His 















245 

<210> 19 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 19 
tggtgtatcg attggtttt 

<210> 20 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 20 
catatatctg gggctagtta ate 

<210> 21 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 21 
acaaggtaat taagatgaag aaagca 

<210> 22 
<211> 20 
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<212> DNA 

<213> Artificial Sequence' 
<220> 

<223> Primer 

<400> 22 
acctgagata gtggcttcct 

<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 23 
tttgtcttcc aattacatgc 

<210> 24 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 24 
agtagatatg gaagcacttg taag 

<210> 25 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 25 
tctcagtgac aaggtaatta agac 

<210> 26 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 26 
cattgattgg ataaaggca 

<210> 27 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 27 
aatttgggtt cttacatatc aa 
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<210> 28 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 28 
gagtgaggga ggacagag 

<210> 29 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 29 
ataagtacac gccttctttt g 

<210> 30 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 30 
gctgctttat acaatttgct ac 

<210> 31 

<211> 22 

. <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 31 
cgcctacgta tcatagcagt ta 

<210> 32 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 32 
ggaaagaaat ttgaaatgca ac 

<210> 33 
<211> 20 
<212> DNA 

<213> TVrtificial Sequence 

<220> 

<223> Primer 
<400> 33 



PCT/US99/09702 



18 



21 



22 



22 
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tctttccgcc tactgtatca 

<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 34 
ttcaagaagg gcagttttat 

<210> 35 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 35 
ctctggcagg agcaaag 

<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 36 
atacacacct gggatagtgg 

<210> 37 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 37 
ggtaattaag atgaagaaag ca 

<210> 38 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 38 
ctgagatagt ggcttcctg 

<210> 39 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 39 
gtggctcaat gacaagg 

<210> 40 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 40 
atatggaagc acttgtaagt aaa 

<210> 41 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 41 
ttaagacgaa ggaaacaatt ct 

<210> 42 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 42 
acctgagata gtggcttcc 

<210> 43 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 43 
atcaaagggt aaaattcaga 

<210> 44 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 44 
ggcagtccaa aagaaata 

<210> 45 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Primer 

<400> 45 
ttttgagggc aggttcta 

<210> 46 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 46 
aatgggacaa atgtaaatga ta 

<210> 47 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 47 
ttctctcatg gctcgca 

<210> 48 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 48 
atgtcaaatc acaattcagt aagg 

<210> 49 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 49 
ccgcctactg tatcatagca 

<210> 50 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 50 
caacgaaatg tcaaatcaca g 

<210> 51 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
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18 



22 



17 



24 



20 
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<220> 

<223> Primer 

<400> 51 
ctctggcagg agcaaag 

<210> 52 
<211> 17 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 52 
acagtgggca gagacag 

<210> 53 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 53 
gtggtttatt ccccgtat 

<210> 54 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 54 
atacacacct gggatagtgg 

<210> 55 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 55 
ggtaattaag atgaagaaag ca 

<210> 56 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 56 
gaaatggcat aggttgtc 

<210> 57 
<211> 17 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 57 
ggccacactc aactgta 

<210> 58 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 58 
ctcaaaaaaa acacagtagg 

<210> 59 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 59 . 
actttttctg ccccttat 

<210> 60 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 60 
atatggaagc acttgtaagt aaa 

<210> 61 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 61 
ttaagacgaa ggaaacaatt ct 

<210> 62 
<211> 17 
<212> DNA 

<213> Artificial Seq^uence 
<220> 

<223> Primer 

<400> 62 
aatggcatac gttgtca . 

<210> 63 
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17 



20 



18 



23 



22 
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<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 63 
agaatggcaa ttatgaaca 

<210> 64 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

. <220> 
<223> Primer 

<400> 64 
tgtgtgccct taaagtct 

<210> 65 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 65 
agaatggcaa ttatgaaca 

<210> 66 
<211> 19 
<212> DNA 
. <213> Artificial Sequence 

<220> 

<223> Primer 

<400> 66 
acctgagata gtggcttcc 

<210> 67 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 67 
ctctggctct gtcctac 

<210> 68 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<40a> 68 
acctgagata gtggcttcc 
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19 



18 



19 



19 



17 
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<210> 69 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 69 
atcaaagggt aaaattcaga 

<210> 70 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 70 
cagcagcttg tcacctac 

<210> 71 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 71 
aatttgcttt tgaaagaatc 

<210> 72 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 72 
ggtaggccca aatactca 

<210> 73 
. <211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 73 
aatttgcttt tgaaagaatc 

<210> 74 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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20 



18 



20 



18 
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<400> 74 
ggcagtccaa aagaaata 

<210> 75 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 75 
ttttgagggc aggttcta 

<210> 76 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 76 
cacctctggc atgactac 

<210> 77 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 77 
ttgcaggagt ttgtttaat 

<210> 78 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 78 
aatgggacaa atgtaaatga ta 

<210> 79 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 79 
cattgcagga gtttgttta 

<210> 80 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 
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18 



18 



18 



19 



22 
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<223> Primer 

<400> 80 
catctgagaa ccctaagaga 

<210> 81 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 81 
agaaatagcc tctgaaattc 

<210> 82 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 82 
atgtcaaatc acaattcagt aagg 

<21G> 83 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 83 
ccgcctactg tatcatagca 

<210> 84 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 84 
gagtgtacga ggttgagtaa g 

<210> 85 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 85 
attttgccag tatcttttta g 

<210> 86 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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20 



20 



24 



20 



21 
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<220> 

<223> Primer 

<400> 86 
caacgaaatg tcaaatcaca g 

<210> 87 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 87 
catcagagac agagcatttt acacctt 

<210> 88 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<4ao> 88 
ggacctattg agccctgcat ctgtct 

<210> 89 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 89 
ggttcccctg ccgcggctgg ccaca 

<2ia> 90 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 90 
gccctgggct gaaagtggaa ag 

<210> 91 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 91 
atgcgggagg ccttgcggga get 

<2ia> 92 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 92 
ctctgcgcgg cggtgctggc taag 

<210> 93 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 93 
taccccaggc caatcatgcc caaca 

<210> 94 
<211> 27 



PCTAJS99/09702 



21 



27 



26 



25 



22 



23 



24 
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<212> DNA 

<213> Homo sapiens 

<400> 94 
tccaggcaaa atacttttta aaaaatg 

<210> 95 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 95 
agcatgcggg aggcctcgcg gga 

<210> 96 

<211> 21 

<212> DNA 

<213> Hc»no sapiens 

<400> 96 
gcgggagctc catgcgagag g 

<210> 97 
<211> 25 
<212> DNA 

<213> Homo sapiens . 

<400> 97 
tggtggtcct caccccggag gtgaa 

<210> 98 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 98 
tacatcaaag aggagaactt tttcac 

<210> 99 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 99 
tgatcaggca cctgaatgct acttcc 

<210> 100 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 100 
acctctgcgg ggcggtgctg g 

<210> 101 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 101 
aagaacatgc tttaccctct ggc* 

<210> 102 
<211> 18 
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26 



26 



21 
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<212> DNA 

<213> Homo sapiens 

<400> 102 
ctctggctct gtcctacc 

<210> 103 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 103 

tcctaccttt gctatgctgt ttct 24 

<210> 104 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 104 

tgtcagtggt ggatatt ^7 

<210> 105 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 105 

ggtggatatt ctcagc ^.e 

<210> 106 

<211> 13 

<212> DNA 

<213> Homo sapiens 

<400> 106 

tcagctatgc ate 23 

<210> 107 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 107 

gcatccgtgt ggctgttccg a 21 

<210> 108 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 108 

tggctgttcc gacgggactt 2a 

<210> 109 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 109 
gggacttcgt gatgga 

<210> 110 
<211> 23 
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<212> DNA 

<213> Homo sapiens 

<400> 110 
gtgatggact accccaggcc gat 

<210> 111 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 111 
cctgcctcct tcgcgcattt cagag 

<210> 112 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 112 
gcgatcattc ctgactgctc ctcag 

<210> 113 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 113 
ccctggagca tgcattcagc ag 

<210> 114 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 114 
cattcagcag cagcccagac cct 

<210> 115 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 115 
tacttcttcc acgtactata tta 

<210> 116 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 116 
ggcctccttc cactatatgt gtgt 

<210> 117 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 117 
ggagagagta cggaaccaca t 



<210> 118 
'<211> 23 
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<212> DNA 

<213> Homo sapiens 

<400> 118 
tcaatttggt tattgcgaac tga 

<210> 119 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 119 
caggggaata gcttgccact at 

<210> 120 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 120 
tgttgcgaac ggactttgtt ttgg 

<210> 121 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 121 
ttcaccagca atcggtggtg g 

<210> 122 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 122 
ctagaaatag cttctgaaat tctcc 

<210> 123 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 123 
cggcatatga tatctacagt caca 

<210> 124 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 124 
tcaatttggt tgctgcgaac aggac 
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